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To Donna 


Preface 


This book presents the basic theory of fields, starting more or less 
from the beginning. It is suitable for a graduate course in field theory, 
or independent study. The reader is expected to have absorbed a serious 
undergraduate course in abstract algebra, not so much for the material 
it contains but for the oft-mentioned mathematical maturity it provides. 

The book begins with a preliminary chapter (Chapter 0), which is 
designed to be quickly scanned or skipped and used as a reference if 
needed. The remainder of the book is divided into three parts. 

Part 1, entitled Basic Theory, begins with a chapter on polynomials. 
Chapter 2 is devoted to various types of field extensions. In Chapter 3, 
we treat algebraic independence, starting with the general notion of a 
dependence relation and concluding with Luroth’s Theorem on 
intermediate fields of a simple transcendental extension. Chapter 4 is 
devoted to the notion of separability of algebraic extensions. 

Part 2 of the book is entitled Galois Theory. Chapter 5 begins with 
the notion of a Galois correspondence between two partially ordered 
sets, and then specializes to the Galois correspondence of a field 
extension, concluding with a brief discussion of the Krull topology. In 
Chapter 6, we discuss the Galois theory of equations. In Chapter 7, we 
take a closer look at a finite field extension E of F as a vector space 
over F. The next two chapters are devoted to a fairly thorough 
discussion of finite fields. MGbius inversion is used in a few brief spots 
in these chapters, so an appendix has been included on this subject. 

Part 3 of the book is entitled The Theory of Binomials. Chapter 10 
covers the roots of unity (that is, the roots of the binomial x" —1) and 
includes Wedderburn’s theorem (a finite division ring is a field). This 
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also seems like the appropriate time to discuss the question of whether a 
given group is the Galois group of a field extension. In Chapter 11, we 
characterize the splitting fields of binomials x" —u, when the base field 
contains the n-th roots of unity. Chapter 12 is devoted to the question 
of solvability of a polynomial equation by radicals. (This chapter might 
make a convenient ending place in a graduate course.) In Chapter 13, 
we determine conditions that characterize the irreducibility of a 
binomial and describe the Galois group of a binomial. Chapter 14 
briefly describes the theory of families of binomials—the so-called 
Kummer theory. 

Sections marked with an asterisk are optional, in that they may be 
skipped without loss of continuity. The unmarked sections might be 
considered as forming a basic core course in field theory. 
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Chapter 0 
Preliminaries 


The purpose of this chapter is to review some basic facts that will be 
needed in the book. The discussion is not intended to be complete, nor 
are all proofs supplied. We suggest that the reader quickly skim this 
chapter (or skip it altogether) and use it as a reference if needed. 


0.1 Lattices 


Definition A partially ordered set (or poset) is a nonempty set P, 
together with a binary relation < on P satisfying the following 
properties. For all a, 6, y € P, 


1) (reflexivity) a<a 
2)  (antisymmetry) a<8,8B<a>a=8 
3) (transitivity) a<B,B<y>oa<y 


If, in addition, 
a,BEPZBSa<PorB<a 


then P is said to be totally ordered. 0 


Any subset of a poset P is also a poset under the restriction of the 
relation defined on P. A totally ordered subset of a poset is called a 
chain. If SC P and s <a for all s ES then a is called an upper bound 
for S. A least upper bound for S, denoted by lub(S) or VS, is an upper 
bound that is less than or equal to any other upper bound. Similar 
statements hold for lower bounds and greatest lower bounds, the latter 
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denoted by glb(S), or AS. A maximal element in a poset P is an 
element a € P such that a < § implies a = @. A minimal element in a 
poset P is an element y€P such that @<vy implies @= +. Zorn’s 
Lemma says that if every chain in a poset P has an upper bound in P 
then P has a maximal element. 


Definition A lattice is a poset L in which every pair of elements a, 
8 €L has a least upper bound, or join, denoted by a V f and a greatest 
lower bound, or meet, denoted by a A (. If every nonempty subset of L 
has a join and a meet then L is called a complete lattice. 0 


Note that any nonempty complete lattice has a greatest element, 
denoted by 1 and a smallest element, denoted by 0. 


Definition A sublattice of a lattice L is a subset S of L that is closed 
under meets and joins. 0 


It is important to note that a subset S of a lattice L can be a lattice 
under the same order relation and yet not be a sublattice of L. As an 
example, consider the collection f of all subgroups of a group G, 
ordered by inclusion. Then f is a subset of the power set P(G), which is 
a lattice under union and intersection. But £ is not a sublattice of P(G) 
since the union of two subgroups need not be a subgroup. On the other 
hand, ¥ is a lattice in its own right under set inclusion, where the meet 
H AK of two subgroups is their intersection and the join H VK is the 
smallest subgroup of G containing H and K. 

In a complete lattice L, joins can be defined in terms of meets: V T 
is the meet of all upper bounds of T. The fact that 1 € L insures that T 
has at least one upper bound, so that the meet is not an empty one. 
The following theorem exploits this idea to give conditions under which 
a subset of a complete lattice is itself a complete lattice. 


Theorem 0.1.1 Let L be a complete lattice. If S C L has the properties 
(i) 1 €S and (ii) TCS, T#0 => ATES, then S is a complete lattice. 


Proof. Let TCS. Then ATES by assumption. Let U be the set of all 
upper bounds of T that lie in S. Since 1€S, we have U #0. Hence, 
AU€ES and is VT. Thus, S is a complete lattice. (Note that S need 
not be a sublattice of L since AU need not equal the meet of all upper 
bounds of T in L.) 
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0.2 Groups 


Definition A binary operation on a set A is a map from AxA to A.0 


Definition A group is a nonempty set G, together with a binary 
operation on G, denoted by juxtaposition, with the following properties: 


1)  (Associativity) (af)y = a(87) for all a, 8, y EG; 

2) (Identity) There exists an element ¢ € G for which ea = ae = a for 
allaéG; 

3)  (Inverses) For each a € G, there is an element a! € G for which 
aat=ala=e. 


A group G is abelian, or commutative, if a@ = fa, for alla, 8E€G.0 


The identity element is often denoted by 1. When G is abelian, the 
group operation is often denoted by + and the identity by 0. 


Definition A subgroup S of a group G is a subset of G that is a group in 
its own right, using the restriction of the operation defined on G. We 
denote the fact that S is a subgroup of G by writing S<G.0 


Let G be a group. Since G is a subgroup of itself and since the 
intersection of subgroups of G is a subgroup of G, Theorem 0.1.1 
implies that the set of subgroups of G forms a complete lattice, where 
HAJ=HNJ and HVJ is the smallest subgroup of G containing both 
H and J. We denote this lattice by #(G). 

A group G is finite if it contains only a finite number of elements. 
The cardinality of a finite group G is called its order and is denoted by 
|G| or o(G). If a EG, and if aX = € for some integer k, we say that k 
is an exponent of a. The smallest positive exponent for a € G is called 
the order of a and is denoted by o(a). An integer m for which a™ = 1 
for all a €G is called an exponent of G. (Note: Some authors use the 
term exponent of G to refer to the smallest positive exponent of G.) 


Theorem 0.2.1 Let G be a group and let a € G. Then k is an exponent 
of a if and only if k is a multiple of o(@). Similarly, the exponents of G 
are precisely the multiples of the smallest positive exponent of G. 0 


While the smallest positive exponent of an element a€G is the 
order of the cyclic subgroup (a) = {a"|n € Z}, this does not extend to 
groups in general, that is, the smallest positive exponent of G may be 
smaller than the order of G. (Example: Z,xZ, has exponent 2 but 
order 4.) We next characterize the smallest positive exponent for finite 
abelian groups. 
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Theorem 0.2.2 Let G be a finite abelian group. 


1) If m is the maximum order of all elements in G then a™ = 1 for 
all a€G. Thus, the smallest positive exponent of G is equal to 
the maximum order of all elements of G. 

2) The smallest positive exponent of G is equal to o(G) if and only if 
G is cyclic. 


Proof. Let a have maximum order m among all the elements in G. 
Suppose that 6™ #1 for some 6 EG and let 0(f)=k<™m. It follows 
that k/m and so there exists a prime p for which p"|k but p"fm. Let 
v <u be the largest integer for which p’ | m. Consider the elements 


a’ =a” and pf! = Bk/P" 


Since o(a’) = m/p” and o(f’) = p" and since (m/p’,p") = 1, it follows 
that 
o(a’ 3") = o(a')o( 8’) = mp"-* >m 


in contradiction to the maximality of m. Thus, all elements 6 €G 
satisfy 6™ = 1. Clearly, m = o(a) is the smallest such positive integer 
and part 1) is proved. Part 2) follows easily from part 1), since a finite 
group G is cyclic if and only if it has an element of order o(G). § 


Let H <G. We may define an equivalence relation on G by saying 
that a~ @ if G~!a EH (or equivalently a! €H). The equivalence 
classes are the left cosets aH = {ah|he€H} of H in G. Thus, the 
distinct left cosets of H form a partition of G. Similarly, the distinct 
right cosets Ha form a partition of G. It is not hard to see that all 
cosets of H have the same cardinality and that there are the same 
number of left cosets of H in G as right cosets. (This is easy when G is 
finite. Otherwise, consider the map aH+Ha~1.) 


Definition The index of H in G, denoted by (G:H), is the cardinality of 
the set G/H of all distinct left cosets of H in G. If G is finite then 
(G:H) = |G|/|H|.0 


Theorem 0.2.3 Let G be a finite group. 


1) (Lagrange) The order of any subgroup of G divides the order of G. 

2) The order of any element of G divides the order of G. 

3) (Converse of Lagrange’s Theorem for Finite Abelian Groups) If A 
is a finite abelian group and if k| o(A) then A has a subgroup of 
order k. 0 
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Normal Subgroups 


Definition A subgroup H of G is normal in G, written H<G, if 
aHa-! =H for alla €G.0 


Definition A group G is simple if it has no normal subgroups other than 
{1} and G.O 


Theorem 0.2.4 The following are equivalent for a subgroup H of G. 


1) H4G. 

2) aH = Ha for all a€G. 

3) For all a €G, there exists a 8 € G such that oH = Hf. 
4) oHa! CH for all a€G. 

5) afEH=> Bo €H foralla, BEG.0 


Theorem 0.2.5 Any subgroup H of a group G of index 2 is normal. 0 


Theorem 0.2.6 If G is a group and {H;} is a collection of normal 
subgroups of G then MH; and VH; are normal subgroups of G. Hence, 
the collection of normal subgroups of G is a complete sublattice of the 
complete lattice £(G) of all subgroups of G. 0 


Theorem 0.2.7 If H <G then the set G/H of all right cosets of H in G 
forms a group under the operation (aH)(GH) =a(H if and only if 
H <G. The group G/H is called the quotient group (or factor group) of 
H in G. The order of G/H is (G:H). 0 


Euler’s Formula 


If a and @ are integers, not both zero, then an integer 6 is called a 
greatest common divisor (gcd) of a and £ if (i) 6| a and 6| @ and (ii) if 
y|a and y| @ then 7|6. Note that if 6 is a gcd of @ and £, then so is 
-6. It is customary to denote a gcd of a and £ by (a,f) or gcd(a,f). 

If (a,@)=1, then a and @ are relatively prime. The Euler phi 
function ¢ is defined by letting ¢(n) be the number of positive integers 
less than or equal to n that are relatively prime to n. The Euler phi 
function is multiplicative, that is, , 


¢(mn) = ¢(m)¢(n), when (m,n) = 1 
It also satisfies 


¢(p") = p""!(p—1), p prime, n> 0 


These two properties completely determine ¢. 
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Two integers a and (@ are congruent modulo n, written a = 8 mod n, 
if a — @ is divisible by n. Let Z,, denote the ring of integers {0,...,n—1} 
under addition and multiplication modulo n. 


Theorem 0.2.8 (Euler’s Theorem) If a, n € Z and (a@,n) = 1, then 
a®(") =1 mod n 


Proof. We first. show that the set G = {@ € Z, | (G,n) = 1} is a group of 
order ¢(n) under multiplication modulo n. Clearly, 6,7 € G imply By € 
G. Also, if @ € G, then there exists a, b € Z such that a@ + bn = 1 and 
so af = 1 mod n. Thus, a modn is the inverse of G EG. Since G is a 
group of order ¢(n), we deduce that a (") =1 mod n, for all a EG. If 
agG, then there exists an a’€G for which a’=amodn. Since 
(a,n) = 1 if and only if (a’,n) = 1, we have 


abl") = (a’)) =1modn | 


Corollary 0.2.9 (Fermat’s Theorem) If p is a prime not dividing the 
integer a, then 


aP = a mod p 1] 


Cyclic Groups 
If G is a group and a € G, then the set of all powers of a 


(a) = {a"|n € Z} 


is a subgroup of G, called the cyclic subgroup generated by a. A group 
G is cyclic if it has the form G = (a), for some a € G. In this case, we 
say that a generates G. 


Theorem 0.2.10 Every subgroup of a cyclic group is cyclic. A finite 
abelian group G is cyclic if and only if its smallest positive exponent is 
equal to o(G). 0 


The following theorem contains some key results about finite cyclic 
groups. 


Theorem 0.2.11 Let G = (a) be a cyclic group of order n. 
1) Forl<k<n, 


k,) on 
o(a*) = i,k) 
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In particular, a* generates G = (a) if and only if (n,k) = 1. 
2) Ifd|n, then 


o(a*)=d & k=r 7 where (r,d) = 1 


Thus the elements of G of order d|n are the elements of the form 


a™™/°, where 0 <r <d and r is relatively prime to d. 


3) For each d|n, the group G has exactly one subgroup H, of order 
d and ¢(d) elements of order d, all of which lie in Hg. 


Proof. To prove part 1), we first observe that if d =(k,n) then d= 
ak + bn for some integers a and b. Hence, 


at = (ak)* € (aX) 


whence (a4) C (a). But the reverse inclusion holds since d|k and so 
(a*) = (a4), Since d|n, it is clear that 


o(a*) = o(a*) = 4 — tk) 


To prove part 2), we let d|n and solve the equation 


Chea 


Rearranging gives 
n = d(n,k) = (dn,dk) 
Setting r = k/(n,k), we get dk = n[k/(n,k)] = nr and so 
n = (dn,rn) = n(d,r) 
which holds if and only if (d,r) = 1. 
For part 3), it follows from part 2) that all of the ¢(d) elements of G 
of order d lie in the subgroup Hj = (an! ). Moreover, if H is a subgroup 
of G of order d then, being cyclic, it must contain an element of order 


d. But 8 € Hg and so H = (8) = Hy. 8 


Counting the elements in a cyclic group of order n gives the following 
corollary. 


Corollary 0.2.12 For any positive integer n, 


n= > ¢(d) 0 


d|n 
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Homomorphisms 


Definition Let G and H be groups. A map ~:G-—H is called a group 
homomorphism if (a) = (ya)(~B). A surjective homomorphism is an 
epimorphism, an injective homomorphism is a monomorphism and a 
bijective homomorphism is an isomorphism. If w:G—-H is an 
isomorphism, we say that G and H are isomorphic and write G ~ H. 0 


If % is a homomorphism then we = € and val = (va)71. The kernel 
of a homomorphism ¥:G—H, 


kerp={aEG|pa=e} 


is a normal subgroup of G. Conversely, any normal subgroup H of G is 
the kernel of a homomorphism. For we may define the natural 
projection 7:G—G/H by wa=aH. This is easily seen to be an 
epimorphism with kernel H. 

Let f:S-T be a function from a set S to a set T. Let P(S) and P(T) 
be the power sets of S and T, respectively. We define the induced map 
f:P(S)9(T) by f(U) = {f(u) | u € U} and the induced inverse map by 
f-1(V) = {s €S| f(s) € V}. (It is customary to denote the induced maps 
by the same notation as the original map.) Note that f is surjective if 
and only if its induced map is surjective, and this holds if and only if 
the induced inverse map is injective. A similar statement holds with the 
words surjective and injective reversed. 


Theorem 0.2.13 Let %:G—G’ be a group homomorphism. 


1) a) IWfH<G then 7(H) <G’. 

b) If wis surjective and H«G then 4(H) 4G’. 
2) a) If H’<G' then y1(H’) <G. 

b) If H’aG’ then ~~1(H’) «G.0 


Theorem 0.2.14 (The Isomorphism Theorems) Let G be a group. 


1) (First Isomorphism Theorem) Let ¥:G—G' be a_ group 
homomorphism with kernel K. Then K«aG and the map 
v:G/K-imy defined by (aK) = pa is an isomorphism. Hence 
G/K ~ im. In particular, ~ is injective if and only if ker = 
{e}. 

2) (Second Isomorphism Theorem) If H<G and NG _ then 
NNH<H and 


H NH 


NNH” N 
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3) (Third Isomorphism Theorem) If H<I<J<G then I/H«J/H 
and I/H 


Hence (J:1) = (J/H:1/H). 0 


Theorem 0.2.15 (The Correspondence Theorem) Let H <«G and let 7 be 
the natural projection 7:G—+G/H. Thus, for any I < G, 


n(I) = 1/H = {iH |ie I} 


1) The induced maps 7 and x~! define a one-to-one correspondence 
between the lattice of subgroups of G containing H and the lattice 
of subgroups of G/H. 

2) -m preserves index, that is, for any H <I <J <G, we have 


(3:1) = (7(9):"(1)) 


3) | m@ preserves normality, that is, if H<I<J<G then I4J if and 
only if 1/H 4J/H, in which case J/I ~ m(J)/7m(1). 0 


Action of a Group on a Set 


Definition Let X be a set and let G be a group. We say that G acts on 
X if there is a function G x XX, sending (a,x) to ax € X, satisfying 


1) 1x=x for all xEX 
2)  (af)x = a(@x) for all x € X, a, BEG. 


We say that G acts transitively on X if for any x, y € X there exists an 
a €G such that ax = y. 0 


It follows from the definition that each a € G acts as a permutation 
T4:xtax of X and that the map a7, is a group homomorphism from 
G to a subgroup of the group of permutations of X. 

Definition Let G act on X. The orbit of x € X is the set 
orb(x) = Gx = {ax|a€ G} 
The stabilizer of x is the subgroup 


G, = {aE G|ax=x} 0 
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Note that G acts transitively on X if and only if orb(x) = X for all 
x € X. We may define an equivalence relation on X by setting x ~ y if 
and only if there exists an a€G for which ax =y. The equivalence 
classes are precisely the orbits in X, which therefore partition the set X. 
Since ax = (x if and only if B~!a€ G,, which in turn holds if and only 
if aG, = GG,, we deduce the existence of a bijection from G/G, onto 
orb(x). 


Theorem 0.2.16 Let G act on X. 

1) For any x€X, |orb(x)| =(G:G,) and if X is finite then 
| orb(x)| = |G|/|G,]. 

2) If G acts transitively on X then |X| =(G:G,) for any x € X and 
if X is finite then |X| = |G|/|G,|. 

3) (The class equation) 


IX| = }0(G:G,) 


where the sum is taken over one representative from each distinct 
orbit in X. 0 


Example 0.2.1 One of the most important instances of a group acting 
on a set is the case where X = G acts on itself by conjugation. To avoid 
obvious confusion, we denote the action of a € G on 8 EG by GP. Then 
as = afa-!, The orbit of 2 € G is the conjugacy class of 3 


orb(B) = {aBa-!|a EG} 
The stabilizer of @ € G is the centralizer of @ 
C(8) = {aE G| af = Ba} 


The previous theorem says that the conjugacy class of # has cardinality 
(G:C(#)). The class equation in this case is 


o(G) = }_ (G:C(8)) 


where the sum is over one representative of each conjugacy class. 

The center of G is the set Z(G) = {@ € G|af = fa for all a€ G}. 
Thus Z(G) consists of those elements of G whose centralizer is equal to 
the entire group G, or equivalently, whose conjugacy class contains only 
the element itself. In other words, 8 € Z(C) if and only if (G:C(@)) = 1. 
We may now write the class equation in the form 


o(G) = o(Z(G)) + 5_ (G:C(A)) 
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where the sum is taken over one representative from each conjugacy 
class of size greater than 1. 0 


Sylow Subgroups 


Definition If p is a prime, then a group G is called a p-group if every 
element of G has order a power of p. 0 


For finite groups, if a €G then o(a)|o(G). The converse does not 
hold in general, but we do have the following. 


Theorem 0.2.17 Let G be a finite group. 


1) (Cauchy) If o(G) is divisible by a prime p then G contains an 
element of order p. 

2) If p is a prime and o(G) is divisible by p™ then G contains a 
subgroup of order p™. 0) 


Corollary 0.2.18 A finite group G is a p-group if and only if |G| =p” 
for some n. 0 


Theorem 0.2.19 (Sylow) If G has order p"m where p/m then G has a 
subgroup of order p", called a Sylow p-subgroup of G. All Sylow p- 
subgroups are conjugate (and hence isomorphic). The number of Sylow 
p-subgroups of G divides o(G) and is congruent to 1 mod p. Any p- 
subgroup of G is contained is a Sylow p-subgroup of G. 0 


The Symmetric Group 


Definition The symmetric group S,, is the group of all permutations of 
the set A = {1,2,...,n}, under composition of maps. A transposition is 
a permutation that interchanges two distinct elements of A and leaves 
all other elements fixed. The alternating group A, is the subgroup of S,, 
consisting of all even permutations, that is, all permutations that can 
be written as a product of an even number of transpositions. 0 


-Theorem 0.2.20 

1) The order of S,, is n!. 

2) The order of A, is n!/2. Thus, [S,:A,] = 2 and A, 4S... 
3) A, is the only subgroup of S,, of index 2. 

4) A, is simple (no nontrivial normal subgroups) for n > 5. 0 
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A subgroup H of S,, is transitive if for any k, j € {1,2,...,n} there is 
ao €H for which ok =j. 


Theorem 0.2.21 If H is a transitive subgroup of S, then o(H) is a 
multiple of n. 


Proof. The group H acts on the set X = {1,2,...,n} and Theorem 0.2.16 
gives |X| = |H|/|G,|, that is, |H| =n|G,|.0 


0.3 Rings 


Definition A ring is a nonempty set R, together with two binary 
operations on R, called addition (denoted by +), and multiplication 
(denoted by juxtaposition), satisfying the following properties. 


1) Ris an abelian group under the operation +. 
2)  (Associativity) (af)y = a(G7) for all a, B, y ER. 
3) (Distributivity) For all a, 8, 7 ER, 


(a+ B)y=ay+af and 7(a+ 8) = 7a+78 q 


Definition Let R be a ring. 


1) Ris called a ring with identity if there exists an element 1 € R for 
which al = la =a, for all aE R. In a ring R with identity, an 
element a is called a unit if it has a multiplicative inverse in R, 
that is, if there exists a 6 € R such that af = Ba =1. 

2) R is called a commutative ring if multiplication is commutative, 
that is, if a@ = Ba for alla, BER. 

3) A zero divisor in a commutative ring R is a nonzero element a € 
R such that af = 0 for some 6 #0. A commutative ring R with 
identity is called an integral domain if R contains no zero divisors. 

4) A ring R with identity 10 is called a field if the nonzero 
elements of R form an abelian group under multiplication. 0 


It is not hard to see that the set of all units in a ring with identity 
forms a group under multiplication. We shall have occasion to use the 
following example. 


Example 0.3.1 Let Z, = {0,...,n-1} be the ring of integers modulo n. 
Then k is a unit in Z, if and only if (k,n) = 1. This follows from the 
fact that (k,n) = 1 if and only if there exists integers a and b such that 
ak + bn = 1, that is, if and only if ak = 1 mod n. The set of units of Z,, 
denoted by Z*, is a group under multiplication. 0 
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Definition A subring of a ring R is a nonempty subset S of R that is a 
ring in its own right, using the same operations as defined on R. 0 


Definition A subfield of a field E is a nonempty subset F of E that is a 
field in its own right, using the same operations as defined on E. In this 
case, we say that E is an extension of F and write F << E or E>F.0 


Definition Let R and S be rings. A function ~:R—S is a homomorphism 
if, for alla, BER, 


Vat B)=yat YP and (af) = (va)(vP) 


An injective homomorphism is a monomorphism or an embedding, a 
surjective homomorphism is an epimorphism and a_ bijective 
homomorphism is an isomorphism. A homomorphism from R into itself 
is an endomorphism and an isomorphism from R onto itself is an 
automorphism. 0] 


Ideals 


Definition A nonempty subset J of a ring R is called an ideal if it 
satisfies 


1) a, BE Jimpliesa-—f Ee). 
2) @ER,ce€3 implies oc €Jand we JI.0 


If S is a nonempty subset of a ring R, then the ideal generated by S 
is defined to be the smallest ideal J of R containing S. If R is a 
commutative ring with identity, and if a € R, then the ideal generated 
by {a} is the set 
(a) = Ra = {pa| p € R} 


Any ideal of the form (a) is called a principal ideal. 
Definition If ~:R—S is a homomorphism, then 


Kery = {a €R| pa = 0} 
is an ideal of R. 0 
If R is a ring and J is an ideal in R then for each a € R, we can form 


the coset ; 
a+3={ate|c€ 3} 
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It is easy to see that a+J= 6 +4 if and only if a— € 4, and that any 
two cosets a+J and 6+ are either disjoint or identical. The collection 


of all (distinct) cosets is a ring itself, with addition and multiplication 
defined by 


(a+ J)+(b+35) =(a+b)+3 
and 
(a+$)(b+ 3) =ab+3 


The ring of cosets of J is called a factor ring and is denoted by R/3. 


Definition An ideal J of a ring R is maximal if J# R and if whenever 
J3C43CR for any ideal 3, then 3 =J or $=R. An ideal J is prime if 
3#R and if a8 €Jimplies a € Jor BEIO 


It is not hard to see that a maximal ideal in a commutative ring 
with identity is prime. This also follows from the next theorem. 


Theorem 0.3.1 Let R be a commutative ring with identity and let J be 
an ideal of R. 


1) R/Jis a field if and only if J is maximal. 
2) R/Jis an integral domain if and only if 9 is prime. 0 


The Characteristic of a Ring 


Let R be a ring and let r € R. For any positive integer n, we define 


nmr=r+r+:--+r 
emma semana! 


n terms 


The characteristic char(R) of a ring R is the smallest positive integer n 
for which nl = 0 (or equivalently, nr = 0 for all r€ R), should such an 
integer exist. If it does not, we say that R has characteristic 0. If 
char(R) =0 then R contains a copy of the integers Z, in the form 
Z-1={nl|neéZ}. If char(R) =r, then R contains a copy of Z.= 
10, lok el}: 


Theorem 0.3.2 The characteristic of an integral domain is either 0 or a 
prime. In particular, a finite field has prime characteristic. 0 


If F is a field, the intersection of all of its subfields is the smallest 
subfield of F and is referred to as the prime subfield of F. 
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Theorem 0.3.3 If char(F) = 0, the prime subfield of F is isomorphic to 
the rational numbers Q. If char(F) = p is prime, the prime field of F is 
isomorphic to Zz, if 


The following result is of considerable importance for the study of 
fields of nonzero characteristic. 


Theorem 0.3.4 Let R be a commutative ring with identity of prime 
characteristic p. If q = p™ then 


(a+ A)9=0%+6%, (a—f)*=a4— 91 


Proof. Since the binomial formula holds in any commutative ring with 
identity, we have 


(a+)? = 3-(P)akge-# 
where ae 
(°) _ P(p-1)- ey 


But p if for 0<k<p, and so (j= 0 in R. The binomial formula 
therefore reduces to 


(a+ 6)P =aP +p 


Repeated use of this formula gives (a+ 6)4=a1+ 6% The second 
formula is proved similarly. 


0.4 Integral Domains 
Theorem 0.4.1 Let R be an integral domain. Let a, 6 € R. 


1) We say that a divides 3 and write a| @ if @ = pa for some pER. 
If p and a are nonunits and § = pa then a properly divides /. 
a) A unit divides every element of R. 
b) a| if and only if (8) C (a). 
c) a|{ properly if and only if (@) C (a) CR. 

2) If a=uf for some unit u then a and # are associates and we 
write a ~ f. 
a) a~ @ if and only if a| @ and #|a. 
b) a~ B if and only if (a) = (6). 

3) A nonzero element p € R is irreducible if p is not a unit and if p 
has no proper divisors. Thus, a nonunit p is irreducible if and only 
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if p = af implies either a or f is a unit. 

4) A nonzero element 7 € R is prime if 7 is not a unit and whenever 

n|af then t|a or r| @. 
a) Every prime element is irreducible. 
b) a €R is prime if and only if (7) is a nonzero prime ideal. 

5) Let a, GER. An element dER is called a greatest common 
divisor (gcd) of a and #, written (a,f) or gcd(a,), if d|a and 
d| @ and if whenever e| a, e| @ then e|d. If gcd(a,@) is a unit, we 
say that a and £ are relatively prime. 

a) The greatest common divisor of two elements, if it exists, is 
unique up to associate. 0 


Theorem 0.4.2 An integral domain R. is a field if and only if it has no 
ideals other than the zero ideal or R itself. Any nonzero homomorphism 
o:F—E of fields is a monomorphism. 0) 


Theorem 0.4.3 Every finite integral domain is a field. 0 


If R is an integral domain, we may form the set 
R’ = {a/B|a, BER, 8 £0} 


where a/@=a/b if and only if ab=af. We define addition and 
multiplication on R’ in the “obvious way” 


a, ,a_abt+fa a a_aa 

Bob Bb * fb fb 

It is easy to see that these operations are well-defined and that R’ is 
actually a field, called the field of quotients of the integral domain R. It 
is the smallest field containing R, in the sense that if F is a field and 
RCF then RC R’ CF. The following fact will prove useful. 


Theorem 0.4.4 Let R be an integral domain with field of quotients R’. 
Then any monomorphism o:R-F from R into a field F has a unique 
extension to a monomorphism @:R'—F. 


Proof. Define G(a/f) = ca/of, which makes sense since @ # 0 implies 
of #0. One can easily show that @ is well-defined. Since ca/of = 0 if 
and only if oa = 0, which in turn holds if and only if a/f =0, we see 
that @ is injective. Uniqueness is clear since |p (o restricted to R) 
uniquely determines o on R’. 
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0.5 Unique Factorization Domains 


Definition An integral domain R is a unique factorization domain (ufd) 

if 

1) Any nonunit r € R can be written as a product r = p,:--p,, where 
p; is irreducible for all i. We refer to this as the factorization 
property for R. 

2) This factorization is essentially unique in the sense that if r= 
Py" Py = 91°''G, are two factorizations into irreducible elements 
then m=n and there is some permutation 7 for which p; ~ (i) 
for all i. 0 


If rE R is not irreducible, then r=st where s and t are nonunits. 
Evidently, we may continue to factor as long as at least one factor is 
not irreducible. An integral domain R has the factorization property 
precisely when this factoring process always stops after a finite number 
of steps. 

When is an integral domain a unique factorization domain? The 
following answer helps explain the importance of ufd’s. 


Theorem 0.5.1 Let R be an integral domain for which the factorization 
property holds. The following conditions are equivalent and therefore 
imply that R is a unique factorization domain. 


1) Factorization in R is essentially unique. 

2) Every irreducible element of R is prime. 

3) Any two elements of R (not both zero) have a greatest common 
divisor. 0 


Corollary 0.5.2 In a unique factorization domain, the concepts of prime 
and irreducible are equivalent. 0 


0.6 Principal Ideal Domains 


Definition An integral domain R is called a principal ideal domain (pid) 
if every ideal of R is principal. 0 


Theorem 0.6.1 Every principal ideal domain is a unique factorization 
domain. 0 


We remark that the ring Z[x] is a ufd (as we prove in Chapter 1) but 
not a pid (the ideal (2,x) is not principal) and so the converse of the 
previous theorem is not true. 
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Theorem 0.6.2 Let R be a principal ideal domain and let J be an ideal 
of R. 


1) Jis maximal if and only if J = (p) where p is irreducible. 

2)  Jis prime if and only if 3 = {0} or J is maximal. 

3) The following are equivalent: (i) R/(p) is a field (ii) R/(p) is an 
integral domain (iii) p is irreducible (iv) p is prime. 0 © 


0.7 Euclidean Domains 


Roughly speaking, a Euclidean domain is an integral domain in 
which we can perform “division with remainder.” 


Definition An integral domain R is a Euclidean domain if there is a 
function o:(R-{0})—-N with the property that given any a, BER, 
8 #0, there exist q, r € R satisfying 

a=q6+r 


where r= 0 or or <of.0 


Theorem 0.7.1 A Euclidean domain is a principal ideal domain (and 
hence also a unique factorization domain). 


Proof. Let J be an ideal in the Euclidean domain R and let a €3 be 
minimal with respect to the value of o. Thus, ca < of for all 6 € 4. If 
s€j then 

s=ra+q 


where q=0 or oq<or. But q=s—raé€J and so the latter is not 
possible, leaving q = 0 and s € (a). Hence, J = (a). 


Theorem 0.7.2 If F is a field, then F[x] is a Euclidean domain with 
o(p(x)) = deg p(x). Hence F[x] is also a principal ideal domain and a 
unique factorization domain. 


Proof. This follows from ordinary division of polynomials; to wit, if 
f(x), g(x) € F[x], g(x) # 0, then there exist q(x), r(x) € F[x] such that 


f(x) = a(x)g(x) + r(x) 


where deg r(x) < deg g(x). & 
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0.8 Tensor Products 
Tensor products are used only in the optional Section 5.6. 


Definition Let U, V and W be vector spaces over a field F. A function 
f:U x VW is bilinear if it is linear in both variables separately, that is, 
if 

f(ru + su’,v) = rf(u,v) + sf(u’,v) 
and 

f(u,rv + sv’) = rf(u,v) + sf(u,v’) 


The set of all bilinear functions from UxV to W is denoted by 
B(U,V;W). A bilinear function f:U x V-F, with values in the base field 
F, is called a bilinear form on U x V. 0 


Example 0.8.1 

1) A real inner product (,):V x VR is a bilinear form on V x V. 

2) If A is an algebra, the product map p:AxA—A defined by 
(a,b) = ab is bilinear. 0 


We will denote the set of all linear transformations from U x V to W 
by 2£(UxV,W). There are many definitions of the tensor product. We 
choose a universal definition. 


Theorem 0.8.1 Let U and V be vector spaces over the same field F. 
There exists a unique vector space U@V and bilinear map 
tUxV—U®V with the following property. If f:UxV—W is any 
bilinear function from U x V to a vector space W over F, then there is a 
unique linear transformation r:U @ VW for which 


rot=f 0 


This theorem says that to each bilinear function f:U x VW, there 
corresponds a unique linear function r:U @ V-W, through which f can 
be factored (that is, f= 701). The vector space U @ V, whose existence 
is guaranteed by the previous theorem, is called the tensor product of U 
and V over F. We denote the image of (u,v) under the map ¢ by 
t(u,v) =u@v. 

If X= Imt={u@v|ue U, v EV} is the image of the tensor map t 
then the uniqueness statement in the theorem implies that X spans 
U®@V. Hence, every element of a € U @V is a finite sum of elements of 
the form u @v 

a=) /ai(u; vj) 


finite 
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We establish a few basic properties of the tensor product. 


Theorem 0.8.2 If {u,,...,u,}@U is linearly independent and 
{vy,---5V¥,} C V then 


dou, 8; =0> v, =0 for alli 
Proof. Consider the dual vectors 6, € U* to the vectors u;, where 6ju; = 


6; j: For linear functionals ¢;:V-—F, we define a bilinear form f:U x VF 
by 


f(u,v) = Y 4040) 
j= 


Since there exists a unique linear functional r:U@V—F for which 
tTot=f, we have 


0= {32 u,@¥,)= yore t(u;,v;) 
i i 
= PCa = a > Si(adei(vy) = ) 
i i j i 
Since the ¢,’s are arbitrary, we deduce that v; = 0 for all i. & 


Corollary 0.8.3 If u #0 and v £0, then u@v 40.0 


Theorem 0.8.4 Let B = {e; |i € I} be a basis for U and C = {f,|j € J} be 
a basis for V. Then the set D = {e, @f, |i EI, j € J} is a basis for UeV. 


Proof. To see that the 9 is linearly independent, suppose that 


dn s(& @f) = 0 
iJ 


si o(Sas=0 


Theorem 0.8.2 implies nee 


This can be written 


i= 
J 


for all i, and hence r; ; = 0 for all i and j. To see that D spans U @V, let 
i,j 


u®@v EUV. Since u = 2th and v = daily we have 


u®@v= ore; ok = S(Saeer) 
i j i 
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j i ij 
Since any vector in U®V is a finite-sum of vectors u@v, we deduce 
that D spans U@V. 4 


Corollary 0.8.5 For finite dimensional vector spaces, 
dim(U ® V) = dim(U) - dim(V) 0 


Exercises 

1. The relation of being associates in an integral domain is an 
equivalence relation. 

2. Prove that the characteristic of an integral domain is either 0 or a 
prime, and that a finite field has prime characteristic. 

3. If char(F)=0, the prime subfield of F is isomorphic to the 
rational numbers Q. If char(F) = p is prime, the prime field of F 
is isomorphic to Z.. 

4. If F < E show that E and F must have the same characteristic. 

5. Let F be a field of characteristic p. The map o:F—F defined by 
oa=a? is a homomorphism. It is called the Frobenius map. 
Show that F » FP = {aP |a € F}. What if F is a finite field? 

6. Consider the polynomial ring F[x,,x,,...] where x, = X;_,-. Show 
that the factorization process need not stop in this ring. 

7 Let R=2Z[/-5] = {a+b/-5|a,b € Z}. Show that this integral 
domain is not a unique factorization domain by showing that 6 € 
R has essentially two different factorizations in R. Show also that 
the irreducible element 2 is not prime. 

8. Let R be a pid. Then an ideal J of R, is maximal if and only if J = 
(p) where p is irreducible. Also, R/(p) is a field if and only if p is 
irreducible. 

9. Prove that (x) and (2,x) are both prime ideals in Z[x] and that (x) 
is properly contained in (2,x). 

10. Describe the divisor chain condition in terms of principal ideals. 


Part 1 
Basic Theory 


Chapter 1 
Polynomials 


In this chapter, we discuss properties of polynomials that will be 
needed in the sequel. Since we assume that the reader is familiar with 
the basic properties of polynomials, some of the present material may 
constitute a review. 


1.1 Polynomials Over a Ring 


We will be concerned in this book mainly with polynomials over a 
field F, but it is useful to make a few remarks about polynomials over a 
ring R as well. Let R[x] denote the ring of polynomials in the single 
variable x over R. If 


p(x) = aj +a,x+-+++a,x” 


where a,€R and a, #0 then n is called the degree of p(x), 
written deg p(x) and a, is called the leading coefficient of p(x). A 
polynomial is monic if its leading coefficient is 1. The degree of the zero 
polynomial is defined to be —oo. 

If R is a ring, the units of R[x] are the units of R, since no 
polynomial of positive degree can have an inverse in R[x]. 


Definition Let R be a ring. A polynomial p(x) € R[x] is irreducible over 
R if whenever p(x) = f(x)g(x) for f(x), g(x) € R[x], then one of f(x) or 
g(x) is a unit in R[x]. A polynomial that is not irreducible is said to be 
reducible. 0) 
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Many important properties that a ring R may possess carry over to 
the ring of polynomials R[x]. For instance, if R is an integral domain, 
then so is R[x] and if R is a unique factorization domain, then so is 
R[x]. Note, however, that the ring Z of integers is a principal ideal 
domain, but Z[x] is not, since the ideal (2,x) is not principal. 
Nonetheless, if F is a field, F[x] is a principal ideal domain (Theorem 
0.7.2). 


1.2 Primitive Polynomials 
We now consider polynomials over a unique factorization domain. 


The reader may wish to take a quick look at Section 0.5. 


Definition Let f(x) € R[x] where R is a unique factorization domain. 
Any greatest common divisor of the coefficients of f(x) is called a 
content of f(x). A polynomial with content 1 is said to be primitive. We 
will use the notation c(f) to denote a content of f(x). 0 


If a is a content of f(x), then @ is also a content of f(x) if and only if 
8B ~a, that is, 8 = ua, where u is a unit in R. Since 


c(ap(x)) ~ ae(p(x)) 
for all a € R, it follows that a is a content of f(x) if and only if f(x) = 
ap(x), where p(x) is primitive. 
We can also define the content of a polynomial over R’, the field of 


quotients of R. To this end, if p is a prime in R, then any nonzero 
element a € R’ has the form 


a= p'ag 
where r is an integer and p does not divide the numerator or 


denominator of a). The integer r is called the order of a at p, written 
op(a). If a = 0, we set 0,(a) = oo. It is easy to see that if ab # 0 then 


0,(ab) = o,(a) + 0,(b) 
If f(x) = Yaz! is a nonzero polynomial in R’[x], we set 
o,(f) = =A 0,,(a}) 
and if f(x) = 0, we set on(f) =o. Then a content of f(x) is defined to be 


4) =«]] pp) 
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where wu is any unit in R, and the product is taken over all primes p for 
which 0, (f) # 0. Thus, content in R’ is unique up to multiplication by a 
unit in R. 

For any a € R’, we have c(ap(x)) = uac(p(x)) where u is a unit in R 
and so a is a content of f(x) € R’[x] if and only if 


f(x) = ap(x) 


where p(x) is a primitive polynomial (and hence in R{[x]). It follows that 
f(x) € R[x] if and only if its content is in R. 
We now come to a key result concerning primitive polynomials. 


Theorem 1.2.1 Let R be a unique factorization domain and let R’ be 
the field of quotients of R. 


1) (Gauss? Lemma) If f(x) and g(x) are primitive in R[x] then so is 
f(x)g(x). 

2) ‘If f(x), g(x) € R’[x] then c(fg) = uc(f)c(g), where u is a unit in R. 

3) Let f(x), g(x) € R[x], with g(x) primitive. If f(x) = g(x)h(x), where 
h(x) € R’(x) then, in fact, h(x) € R[x]. 

Proof. To prove Gauss’ Lemma, suppose that fg is not primitive. Then 

there exists an irreducible element r € R for which r|fg. Since R is a 


unique factorization domain, r is also prime. Hence (r) is a prime ideal 
and R[x]/(r) is an integral domain. Since r | fg, we have fg € (r) and so 


(f+ (r))(g + (r)) = fg + (r) = (7) 


whence f + (r) = (r) or g + (r) = (r), that is, r|f or r|g. Hence, one of f 
or g is not primitive. 

To prove part 2), observe that if c, is a content of f(x) and c, is a 
content of g(x) then f = cg’ and g = c,g’, where f’ and g’ are prantve: 
Hence, by Gauss’ Lemma 


c(fg) = c(cge,f’g’) ~ cecgc(f’g’) = gc, 
As to part 3), we have 
e(f) ~ e(g)c(h) ~ c(h) 
and since c(f) € R, so is c(h), whence h(x) € R[x]. & 


The previous theorem can be used to relate the irreducibility of a 
polynomial over a unique factorization domain R to its irreducibility 
over the field of quotients R’ of R. The next theorem says in loose 
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terms that the only difference between irreducibility over R and over R’ 
is how constant factors are treated. 


Theorem 1.2.2 Let R be a unique factorization domain, with field of 
quotients R’. 


1) A primitive polynomial p(x) € R[x] is irreducible over R if and 
only if it is irreducible over R’. 

2) A polynomial f(x) € R[x] is irreducible over R if and only if it is 
either an irreducible element of R or a primitive polynomial that 
is also irreducible over R’. 


Proof. To prove part 1), observe that a primitive polynomial p(x) has 
no constant nonunit factors and so p(x) is irreducible over R if and only 
if it can be written as a product of nonconstant factors over R. Hence, if 
p(x) is reducible over R, it is also reducible over R’. On the other hand, 
if p(x) is reducible over R’, then it has the form p(x) = f(x)g(x), where 
f(x) and g(x) are nonconstant polynomials in R’[x]. Now we may write 


p(x) = f(x)[e(f)g(x)] 


where f'(x) is primitive and hence, by Theorem 1.2.1, c(f)g(x) is a 
polynomial over R. Thus f(x) is reducible over R as well. 

To prove part 2), note that if f(x) is a constant, then there is nothing 
to prove, since the constant nonunits in R[x] are precisely the nonunits 
in R. On the other hand, if f(x) has positive degree, then it is 
irreducible over R if and only if it is both primitive and irreducible over 
R. and this is equivalent, by part 1), to being primitive and irreducible 
over R’. i 


1.3 The Division Algorithm 


The familiar division algorithm for polynomials over a field F can be 
easily extended to polynomials over a commutative ring with identity, 
provided that we divide only by polynomials whose leading coefficient is 
a unit. We leave proof of the following to the reader. 


Theorem 1.3.1 (Division algorithm) Let R be a commutative ring with 
identity. Let g(x) € R[x] have invertible leading coefficient. Then for 
any f(x) € R[x], there exist unique q(x), r(x) € R[x] such that 


f(x) = q(x)g(x) + r(x) 


where deg r(x) < deg g(x). 0 


1 Polynomials 29 


This theorem has some very important immediate consequences. 


Corollary 1.3.2 Let R be a commutative ring with identity and let 
f(x) € R[x]. Then a@ is a root of f(x) if and only if x—a is a factor of 
f(x) over R. 0 


Since the usual degree formula deg f(x)g(x) = deg f(x) + deg g(x) 
holds when R is an integral domain, we have the following. 


Corollary 1.3.3 If R is ari integral domain then a nonzero polynomial 
f(x) € R[x] can have at most deg f(x) roots in R. 0 


In the customary way, a polynomial p(x) € R[x] can be thought of as 
a function on R. If R is an integral domain, Corollary 1.3.3 insures that 
if p(r) = 0 for an infinite number of distinct values of r€ R then p(x) 
must be the zero polynomial. Thus, if R is infinite, then p(x) is zero as 
a function if and only if it is zero as a polynomial. Note that this does 
not hold for finite fields, for instance, the nonzero polynomial p(x) = 
x?—x is the zero function on Z,. This result can be extended to 
polynomials in more than one variable by induction and we leave the 
details to the reader. 

A polynomial in more than one variable may have infinitely many 
zeros, however, and yet not be the zero polynomial. For instance 
p(x,y)=x—y has infinitely many zeros over R. This example 
notwithstanding, we do have the following useful result, which says 
informally that if a polynomial has a whole subfield worth of zeros, 
then it must be the zero polynomial. 


Theorem 1.3.4 Let F be an infinite field and let L be an extension of F. 
Suppose that q(x,,...,x,) is a polynomial over L. If q(a,,...,a,) = 0 for 
all a; € F then q(x,,...,x,) is the zero polynomial. 


Proof. Write 
q(X1)---)X,) = BES s 
: e 
where x° = Xqlee extn and A, € L. Let {;} be a basis for L as a vector 


space over F. Then 
re = ye if i 
7] 


for aye F and so 


q(X41++-)X,) = SoA = 9 Yo ae i8X° a > 8( 33,2") 
e ei i e 


If b; € F, we have 
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0 = q(bj,.. -,b,) = S A{ Do a.,b*) 


and the independence of the (;’s implies that 


> oa, jb° =0 
e€ 


for all i. Since this holds for all b; € F, the polynomial do ae ix” over F 
must be the zero polynomial. It follows that a, ; = 0 for all e and i, and 
so A, = 0 for all e, whence q(x,,...,x,) =0. 1 


Corollary 1.3.3 can be used to prove a fundamental fact concerning 
finite fields. 


Corollary 1.3.5 Let F be a finite field. The multiplicative group F* of 
all nonzero elements of F is cyclic. 


Proof. Let |F*| =q-—1 and let a have maximum order m<q-1 
among all the elements in F*. Since F* is a finite abelian group, 
Theorem 0.2.2 implies that a™ = 1 for all a € F*. Thus, every element 
of F* is a root of the polynomial x™—1, which has at most m roots. 
Hence m = q —1, and F®™ is cyclic. § 


In defining the greatest common divisor of two polynomials, it is 
customary (in order to obtain uniqueness) to require that it be monic. 


Definition Let f(x) and g(x) be polynomials over F. The greatest 
common divisor of f(x) and g(x), denoted by (f(x),g(x)) or 
gcd(f(x),g(x)), is the unique monic polynomial p(x) over F for which 


1) p(x) | f(x) and p(x) | g(x). 
2) ‘If r(x) € F[x] and r(x) | f(x) and r(x) | g(x) then r(x) | p(x). 0 


The existence of greatest common divisors and the fact that d(x) = 
gcd(f(x),g(x)) is independent of the field F, that is, d(x) lies in any field 
K containing the coefficients of f(x) and g(x), follow from the fact that 
F[x] is a principal ideal domain. In particular, the ideal I = (f(x),g(x)) 
of K[x] is principal and so I = (p(x)) where p(x) € K[x]. Since f(x) € 
(p(x)), we have p(x) | f(x) and similarly p(x) | g(x) over K and hence 
over any larger field F. Since p(x) € (f(x),g(x)), there exist a(x), b(x) € 
K[x] such that p(x) = a(x)f(x) + b(x)g(x). Hence, if q(x)|f(x) and 
q(x) | g(x) over F then q(x) | p(x) over F. Thus, p(x) = gcd(f(x),g(x)). 


Theorem 1.3.6 Let f(x), g(x) € F[x] and let K be the smallest subfield of 
F containing the coefficients of f(x) and g(x). Then there exist a(x), 
b(x) € K[x] such that gcd(f(x),g(x)) = a(x)f(x) + b(x)g(x) € K[x]. 0 
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Theorem 1.3.7 Let f(x), g(x) € F[x] and let F < E. Then f(x) and g(x) 
have a nonconstant common factor over F if and only if they have a 
nonconstant common factor over E. 


Proof. Any common divisor h(x) of f(x) and g(x) over E is also a 
divisor of a(x)f(x) + b(x)g(x) = gcd(f(x),g(x)). Hence, if h(x) is 
nonconstant, so is gcd(f(x),g(x)). 


Definition The polynomials f(x), g(x) € F[x] are relatively prime if 
gcd(f(x),g(x)) = 1. In particular, f(x) and g(x) are relatively prime if 
and only if there exist polynomials a(x), b(x) € F[x] for which 


a(x)f(x) + b(x)g(x) = 1 u 


Corollary 1.3.8 The polynomials f(x), g(x) € F[x] are relatively prime if 
and only if they have no common roots in any extension field E of F. 


Proof. If gcd(f(x),g(x)) =1 then a(x)f(x) + b(x)g(x) =1 implies that 
f(x) and g(x) have no common roots in any extension. Conversely, if 
gcd(f(x),g(x)) is nonconstant, any of its roots is a common root of f(x) 
and g(x) in some extension. §f 


Corollary 1.3.9 If f(x) and g(x) are distinct monic irreducible 
polynomials over F then they have no common roots in any extension E 
of F.0 


1.4 Splitting Fields 


It is a fundamental fact that every nonconstant polynomial 
f(x) € F[x] has a root in some field. 


Theorem 1.4.1 Let F be a field, and let f(x) € F[x] be a nonconstant 
polynomial. Then there exists an extension E of F and an a € E such 
that f(a) = 0. 

Proof. We may assume that f(x) is irreducible. Consider the field E = 
F[x]/(f(x)). The field F may be thought of as a subfield of E, by 
identifying a € F with a+(f(x)) € E. Then x + (f(x)) is a root of f(x) in 
E. (We have actually shown that F can be embedded in a field in which 


f(x) has a root, but this is sufficient in view of Exercise 17 of Chapter 
2.) 


Repeated application of Theorem 1.4.1 gives the following corollary. 
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Corollary 1.4.2 Let f(x) € F[x]. There exists an extension E of F such 
that f(x) factors into linear factors over E. 0 


If a polynomial f(x) € F[x] factors into linear factors 


f(x) = a(x — a,)(x — aq):+-(x-—@,) 


over an extension field E (that is, if a, a,,...,a, € E), we say that f(x) 
splits in E. 


Definition Let ¥ = {f,(x)} be family of polynomials over a field F. A 
splitting field for F is an extension field E of F with the property that 
each f;(x) in ¥ splits in E and that E is generated over F by the roots of 
the polynomials in F. 0 


Corollary 1.4.3 Every finite family of polynomials over a field F has a 
splitting field. 


Proof. Corollary 1.4.2 implies that any single polynomial has a splitting 
field. If ¥ is a finite family of polynomials, then a splitting field for ¥ is 
a splitting field for the product of the polynomials in F. § 


We will see in the next chapter that any family of polynomials has a 
splitting field. We will also see that any two splitting fields S, and S, 
for a family of polynomials over F are isomorphic by an isomorphism 
that fixes each element of the base field F. 


1.5 The Minimal Polynomial 


Let F < E. An element a € E is said to be algebraic over F if there is 
some nonzero polynomial f(x) € F[x] for which f(a~) =0. An element 
that is not algebraic over F is said to be transcendental over F. 

If a is algebraic over F, the set 


d= {g(x) € F[x] | g(a) = 0} 


is a nonzero ideal in F[x] and is therefore generated by a unique monic 
polynomial p(x), called the minimal polynomial of a over F and 
denoted by min(a,F). The following theorem characterizes minimal 
polynomials in a variety of useful ways. Proof is left to the reader. 


Theorem 1.5.1 Let F <E and let p(x) = min(a,F) where a € E. Then 
among all polynomials in F[x], the polynomial p(x) is 
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1) the unique monic irreducible polynomial for which p(a) = 

2) the unique monic polynomial of smallest degree for which 
p(a) = 0 

3) the unique monic polynomial with the property that f(a) = 0 if 
and only if p(x) | f(x). 0 


Definition Let F < E. Then a, 8 € E are said to be conjugates over F if 
they have the same minimal polynomial over F. 0 


1.6 Multiple Roots 


Definition Let a be a root of f(x) € F[x]. The multiplicity of a is the 
largest positive integer n for which (x—a)” divides f(x). If n=1, we 
say that a is a simple root and if n > 1, we say that a is a multiple root 
of f(x). 0 


Definition An irreducible polynomial f(x) € F[x] is said to be separable 
if it has no multiple roots in any extension of F. An irreducible 
polynomial that is not separable is inseparable. ] 


Although, as we now show, all irreducible polynomials over a field of 
characteristic zero or a finite field are separable, the concept of 
separability plays a key role in the theory of more “unusual” fields. 


Theorem 1.6.1 A polynomial f(x) has no multiple roots if and only if 
f(x) and its derivative f(x) are relatively prime. 


Proof. Over a splitting field E for f(x), we have 
A(x) = (x= 04)"2 (= 4)" 


where the a;,’s are distinct. It is easy to see that f(x) and f'(x) have no 
nontrivial common factors over E if and only if e; = 1 for all i. Thus, 
f(x) has no multiple roots in E if and only if f(x) and f’(x) are relatively 
prime. ff 


Corollary 1.6.2 An irreducible polynomial f(x) is separable if and only if 
‘f'(x) #0. 

Proof. Since deg f(x) < deg f(x) and f(x) is irreducible, we deduce that 
f(x) and f’(x) are relatively prime if and only if f(x) #0. & 


If char(F) = 0 then f'(x) #0 for any nonconstant f(x). ‘Thus, we get 
the following corollary. 
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Corollary 1.6.3 All irreducible polynomials over a field of characteristic 
0 are separable. 0 


For char(F)=p 0, the next result says that the inseparable 
polynomials are precisely the polynomials of the form g(x’) for some 
d>1. 


Corollary 1.6.4 Let char(F) = p # 0 and let f(x) € F[x] be irreducible. 


1) If f(x) is inseparable, then there exists a positive integer d such 
that f(x) = q(x’), where q(x) is separable. In this case, all roots of 
f(x) have multiplicity p4. 
2) ‘If f(x) = h(x?) where h(x) is any nonconstant polynomial and d is 
a positive integer, then f(x) is inseparable. 
Proof. For the first statement in part 1), suppose that f(x) = Dax! has 
a multiple root in some extension E of F. Then f(x) = 0 which implies 
that ia; = 0 for all i, which in turn implies that p|i for all i such that 
a; # 0. Hence, f(x) = q(x?). If q(x) has no multiple roots, we are done. 
If not, then we may repeat the argument with the irreducible 
polynomial q(x), eventually obtaining the desired result. 
For part 2), if h(x) is not separable, then by part 1), we have h(x) = 
q(x?) where q(x) is separable and so 


f(x) = h(x) = q(x?) 


Thus, we may suppose that h(x) is separable. Let K be a field in which 
both f(x) and h(x) split. Over K, we have h(x) = (x — @)-+-(x — a) and 


so 


f(x) = (x — a4)-+-(x — ay) 


where the a; € K are distinct. Since f(x) splits in K, there exist roots 
A; € K for each of the factors x° —a,, and so a, = BF. Hence, 


f(x) = (x? — pe). -(xF* — af) 
Since char(F) = p, 
f(x) = (x — By)P+-(x— By)” 


which shows that all the roots of f(x) have multiplicity p?. This proves 
part 2) and also the second statement in part 1). & 


Corollary 1.6.5 All irreducible polynomials over a finite field are 
separable. 
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Proof. Let char(F) = p. The field F is an extension of its prime subfield 
Z, and if the dimension of F as a vector space over Z, is n, then F has 
q=p" elements. Hence, the multiplicative group F* of nonzero 
elements of F has order q—1 and so a4 =a for all a F. In particular, 
any element of F is a p-th power of some other element of F. Thus, any 


polynomial of the form q(x?) satisfies 
G(x?) = ap + ayxP +++ +b a,xP” 
= bp + bPxP +--+ BPxtP 
= (by + byx +--+ +b, x")P 
and so is not irreducible. § 


We should note that in infinite fields of nonzero characteristic, there 
are irreducible polynomials with multiple roots. 


Example 1.6.1 Let F be a field of characteristic 2 and consider the field 
F(y) of all rational functions in the variable y. The polynomial f(x) = 
x? —y? is irreducible over the subfield F(y?), since it has no linear 
factors over F(y”). However, in F(y) we have f(x) = (x—y)? and so y is 
a double root of f(x). 0 


1.7 Testing for Irreducibility 
We discuss two well-known methods for testing a polynomial for 


irreducibility. 


Theorem 1.7.1 (Eisenstein’s criterion) Let R be an integral domain and 
let p(x) = ag +a,x +-+++ a,x” € R[x] have relatively prime coefficients. 
If there exists a prime p € R satisfying 


p|a; for 0 <i<n, pfa,, p? {ag 


then p(x) is irreducible. 


Proof. Suppose that p(x) = f(x)g(x) where neither factor is a unit. If 
f(x) =f) € R then fy divides a, for all i, implying that f(x) =fp is a 
unit, which is not the case. Thus, degf(x)>0 and _ similarly 
deg g(x) > 0. Let 


f(x) =fyttpxte+fx* and g(x) = By +eyx+-°++8,,x™ 


Since a) = fogq and p| ag, p*} ag we may assume that p|fy and p}'gp. 
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Let 0 <i<n be the smallest integer for which p/f, and consider the 
coefficient 


a = fog; + fgi_1 +++: + figo 


We have p|a;, p| fog;,..-,f,_18, but p/f,go, a contradiction. Hence p(x) 
is irreducible. &f 


Eisenstein’s criterion can be useful as a theoretical tool. 


Corollary 1.7.2 For every positive integer n, there is an irreducible 
polynomial p, (x) of degree n over the integers. 0) 


A useful approach to testing for irreducibility over Z[x], and hence 
also over Q[x], is localization. For a prime p, let o:Z—Z,, be the natural 
map 

on =n =n + (p) 


If p(x) € Z[x] we denote (op)(x) by p(x). 


Theorem 1.7.3 Let p(x) = ag + a,x +---+ a,x" € Z[x] be primitive. Let 
p be a prime that does not divide a,. If p(x) is irreducible over Z, then 
p(x) is irreducible over Z. 


Proof. Assume that p(x) is irreducible over Z, but that p(x) = f(x)g(x) 
is the product of nonunits over Z. Then p(x) = f(x)g(x). Since a, #0 
mod p, we have 


deg f(x) + deg &(x) = deg p(x) = deg p(x) = deg f(x) + deg g(x) 


which implies that deg f(x) = deg f(x) and deg g(x) = deg g(x). Since 
P(x) is irreducible, we must have deg f(x) = 0 or deg g(x) = 0, implying 
that one of f(x) or g(x) is a constant (nonunit), in contradiction to the 
primitiveness of p(x). Hence, p(x) is irreducible over Z. § 


Exercises 

1. Prove that if R is an integral domain then so is R[x,,...,x,]- 

2. Describe the units in F[x] where F is a field. 

3. Let R be an integral domain. Prove that c(ap(x)) ~ ac(p(x)) for 
any p(x) € R[x] and aE R. 

4. Prove that if n>1 then the ring F[x,,...,x,] is not a principal 
ideal domain. 

5. — If f(x) € R[x] where R is an integral domain with field of quotients 
R’, then f(x) can also be viewed as a polynomial in R'[x]. Show 
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that the definition of content for f(x) € R[x] agrees with the 
definition of content for f(x) € R’[x]. 

Verify the division algorithm (Theorem 1.3.1) for commutative 
rings with identity. Hint: try induction on deg f(x). 

Show that the condition that p(x) be primitive is essential in the 
first part of Theorem 1.2.2. 

Prove Theorem 1.5.1. 

Let deg p(x) =d. The reciprocal polynomial is q(x) = x%p(x7?). 
Are the irreducibility of p(x) and q(x) related? Can you deduce an 
alternate version of Eisenstein’s criterion from this? 

Show that if p is a prime in an integral domain R, the polynomial 
p(x) = x" — p is irreducible. 

Prove that for every positive integer n there is an irreducible 
polynomial p,(x) € Z[x] of degree n. 

For p prime show that p(x) = 1+x+x?+---4+xP7! is irreducible 
over Z[x]. Hint: apply Eisenstein to the polynomial p(x+1). 

Use the idea of localization (apply the map o) to deduce that 
Eisenstein’s criterion implies irreducibility in Z[x]. 

Prove that for p prime, x" + px + p? is irreducible over Z[x]. 

If R is an infinite integral domain and p(x,,...,x,) is a 
polynomial in several variables over R, show that p(x,,...,x,) is 
zero as a function if and only if it is zero as a polynomial. 


If f(x) is a polynomial of degree d, we define the reciprocal polynomial 
by fp(x) = x4f(x!), Thus, if 


then 


f(x) = a,x™+a,_yx™ 1 +---+a,x +a, 


f(x) = agx™ +ayxP petal ixta, 


If a polynomial satisfies f(x) = fy(x), we say that f(x) is self-reciprocal. 


16. 


17. 


18. 


19. 


20. 


Show that a £0 is a root of f(x) if and only if a? is a root of 
f(x). 

Show that the reciprocal of an irreducible polynomial f(x) # x is 
also irreducible. 

Show that if a polynomial f(x) is self-reciprocal and irreducible, 
then deg f(x) must be even. 

Suppose that f(x) = p(x)q(x), where p(x) and q(x) are irreducible, 
and f(x) is self-reciprocal. Show that either 

(i) p(x) = 6pp(x) and q(x) = bap (x) with 6= +1, or 

(ii) p(x) = agp(x) and q(x) = a “pp(x) for some a € GF(q). 
What can you say about this if deg p(x) is odd? 

There is a simple (but not necessarily practical) algorithm for 
factoring any polynomial over Q, due to Kronecker. In view of 
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Theorem 1.2.2, it suffices to consider polynomials with integer 
coefficients. Prove that a polynomial of degree n is completely 
determined by specifying n+1 of its values. Hint: Use the 
Lagrange Interpolation Formula 


p(x) = 000 | =| 
Tl 


i=0 j 


Let f(x) be a polynomial of degree n>1 over Z. If f(x) has a 
nonconstant factor p(x) of degree at most n/2, what can you say 
about the values p(i) for i=0,...,[n/2]? Construct an algorithm 
for factoring f(x) into irreducible factors. 


Chapter 2 
Field Extensions 


Field extensions F < E can be characterized in a variety of useful 
ways. Some characterizations involve properties of the individual 
elements of the extension. For instance, an extension F < E is algebraic 
if each element a€E is algebraic over F. Other characterizations 
involve the field E as a whole. For instance, F < E is normal if E is the 
splitting field for a family of polynomials over F. In this chapter, we 
will describe several types of extensions and study their basic properties. 


2.1 The Lattice of Subfields of a Field 


If E is an extension field of F, then E can be viewed as a vector space 
over F. The dimension of E over F is denoted by [E:F] and called the 
degree of E over F. A sequence of fields Ej,...,E,, for which E; < E; 41 
is referred to as a tower of fields, and we write E, < E, <--:< E,. The 
fact that dimension is multiplicative over towers is fundamental. 


Theorem 2.1.1 Let F < K < E. Then 


[E:F] = [E:K][K:F] 
Moreover, if A = {a;|i €I} is a basis for E over K and B = {f; |j-€ J} 
is a basis for K over F, then the set C = {a;8; ji€ljeJ}isa pais for 
E over F. 
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Proof. For the independence of C, if }/ a; ;a,8; = 0 then )a; ja; = 0 for 
all j, and the latter implies that a; j=? for all i, j. Hence, C is 
independent. Next, if y€E then there exist a;€K such that y= 
>- a,q;. Since each a; is a linear combination of the B;’s, it follows that 
7 is a linear combination of the products a;(;. Hence C is a basis for E 


over F. 8 


If F and E are subfields of a field K, then the intersection FN E is 
clearly a field. The composite FE of F and E is defined to be the 
smallest subfield of K containing both F and E. The composite FE is 
also equal to the intersection of all subfields of K containing E and F. 
More generally, the composite VE; of a family § = {E, |i € I} of fields, 
all of which are contained in a single field E, is the smallest subfield of 
E containing all members of the family. Note that the composite of 
fields is defined only when the fields are all contained in one larger field. 
Whenever we form a composite, it is with the tacit understanding that 
the relevant fields are so contained. 

A monomial over a family § = {E,|i € I} of fields with E; < E is an 
element of E of the form 

& 8, : ca where e, E€ Ei. 
Note that the set of all finite sums of monomials over & is the smallest 
subring R of E containing each field E; and the set of all quotients of 
elements of R (the quotient field of R) is the composite VE,. Thus, 
each element of V E, involves only a finite number of elements from the 
union UE; and is therefore contained in a composite of a finite number 
of fields from the family &. 

The collection of all subfields of a field K forms a complete lattice £ 
(under set inclusion), with meet being intersection and join being 
composite. The zero element in £ is the prime subfield of K and the 
unit element is K itself. 


2.2 Distinguished Extensions 


Following Lang, we will say that a class © of field extensions is 
distinguished provided that 


Dl) If F<K<E, then (F<E)€€ if and only if (F<K)€€ and 
(K <E)€€. 


D2) If (F <E) €C and F < K and EX is defined, then (K < EK) €C. 
Note that if C is distinguished, then 
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D3) If (F<E)E€C and (F<K)EC and EK is defined, then 
(F < EK) EC. In other words, € is closed under taking (a finite 
number of) composites. 


Figure 2.2.1 illustrates D1) and D2). We refer to K < EK as the 
lifting of the extension F < E by K. 


E EK 

K E K 

F F 
Figure 2.2.1 


If a class C of extensions has the property that whenever (F < E,) EC 
for each member of a family {E;} of fields and if VE; is defined, then 
(F < VE;) EC, we say that C is closed under the taking of arbitrary 
composites. 


2.3 Finitely Generated Extensions 


If S is a subset of a field E and if F < E, we denote the smallest 
subfield of E containing F and S by F(S). When S = {a,,...,a,} is a 
finite set, it is customary to write F(a,,...,a,) for F(S). Note that for 
1<k<n-l, 


F (04). - 65%) = [F(0q5.- +, MH) (Og ase ++ Mp) 


Definition Any field of the form E = F(q,,...,a,) is said to be finitely 
generated over F. We also say that the extension F < E is finitely 
generated. Any extension of the form F< F(a) is called a simple 
extension and a is a primitive element in F(a). 0 


The reader may have encountered a different meaning of the term 
primitive in connection with elements of a finite field. We will discuss 
this alternate meaning when we discuss finite fields later in the book. 

It is evident that F(qa,,...,a,) consists of all quotients of 
polynomials in the a,’s: 


F(ay,...,%,) af Navies ta) 


B(O4,-+-)Q,) 


| e(ays..-10,) #0} 
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Theorem 2.3.1 The class of all finitely generated extensions is 
distinguished. 


Proof. For D1, let F< K < FE. If E=K(S) and K = F(T) where S and 
T are finite, then E = F(SUT) is finitely generated over F. Clearly, if 
F <E is finitely generated then K < E is also finitely generated by the 
same set of generators. However, the proof that F <K is finitely 
generated must be postponed until we have discussed transcendental 
extensions in the next chapter. Statement D2 follows from the fact that 
if E = F(S), S finite, and F < K then 


KE = K(F(S)) = K(S) 


and so KE is finitely generated over K. 


2.4 Simple Extensions 


Since F[x] is a principal ideal domain, the ideal (p(x)) generated by 
p(x) € F[x] is maximal, and the quotient ring 


_ Fix) 
K=O) - 


is a field, if and only if p(x) is irreducible. We can use this observation 
to characterize simple algebraic extensions. 


Theorem 2.4.1 Let F < E and let a € E be algebraic over F. Then F(a) 
is isomorphic to the field 

___ Fix 

~ (min(a,F)) 


Proof. Let y:F[x]—E be the evaluation (ring) homomorphism defined 
by +(f(x)) = f(a). The kernel of ~ is the ideal (min(a,F)), and so K is 
isomorphic to ¥(F[x]), which implies that ¥(F[x]) is a field. Thus, we 
need only show that 7(F[x]) = F(a). Clearly, ¥(F[x]) C F(a). But 
a = (x) € Y(F[x]) and F C ¥(F[x]) imply that F(a) C ¥(F[x]). Hence, 
¥(F[x]) = F(a). 8 


K 


Let p(x) be irreducible over F. Since addition and multiplication in 
K = F[x]/(p(x)) is done using coset representatives and since 


K' = {f(x) € F[x] | deg f(x) < deg p(x)} 


is a complete set of distinct coset representatives for K, we may identify 
K with K’, where addition and multiplication are performed modulo 
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p(x). This allows us the customary practice of thinking of F as a 
subfield of K. Note also that, as a vector space over F, we have 
dim K = deg p(x). In the symbolism of Theorem 2.4.1, we have 
[F(a@):F] = deg min(a,F). 

Thus F(a) is the set of all polynomials in a of degree less than d = 
deg min(a,F), with addition and multiplication modulo min(a,F). It 
follows that the set {1,a,...,a¢~1} is a basis for F(a) over F. 

As for simple transcendental extensions, we have the following. 


Theorem 2.4.2 Let F < E and let a € E be transcendental over F. Then 
F(a) is isomorphic to the field of all rational functions F(x) in a single 
variable x. 


Proof. The evaluation homomorphism y:F(x)-E is injective, for if 
f(a)/g(a) = 0 then f(a) = 0, which implies that f(x) = 0, since otherwise 
a would be algebraic. Since ¥(F(x)) = F(a), we deduce that y is an 
isomorphism from F(x) onto F(a). 


2.5 Finite Extensions 


If F < E and [E:F] is finite, we say that E is a finite extension of F or 
that F < E is finite. We have already seen that the following is true. 


Theorem 2.5.1 If F < E and if a € E is algebraic over F then F < F(a) 
is finite, and [F(a):F] = deg min(a,F). 0 


Theorem 2.5.2 An extension is finite if and only if it is finitely 
generated by algebraic elements. 


Proof. If F < E is finite and if {a,,...,a,,} is a basis for E over F, then 
E = F(qj,...,@,) is finitely generated over F. Moreover, for each k, the 
infinite set of nonnegative powers of a, cannot be linearly independent 
over F, it follows that a, must be algebraic over F. 

For the converse, assume that E = F(aj,...,0,), where each a; is 
algebraic over F, and consider the tower 


F < F(a,) < F(ay,a9) < +++ < F(ay,...,@,) =E 


Since a; is algebraic over F(a,,...,0;_,), each extension in the tower is 
finite, and so E is finite over F by Theorem 2.1.1. § 


Suppose that E = F(qj,...,a,) is finitely generated by algebraic 
elements a; over F and consider the tower 
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F < F(a,) < F(a 1,09) <+++ < F(a,,...,a,,) =E 


Our results on simple algebraic extensions show that any element of 
F(a,) is a polynomial in a, over F. Further, any element of F(a,,a) is 
a polynomial in a, over F(a,), and hence a polynomial in the two 
variables a, and a. Continuing in this way, we conclude that E is the 
set of all polynomials over F in aj,...,a,. 


Theorem 2.5.3 The class of finite extensions is distinguished. 


Proof. The multiplicativity of degree shows that D1 holds. For D2, let 
F<E be finite, with basis {a,,...,a,} and let F<K. Thus E= 
F(q,,...,4,,) where each a; is algebraic over F and so also over K. Since 
EK = K(q,,...,@,) is finitely generated by elements algebraic over K, it 
is a finite extension of K. § 


Note that if E is a splitting field for p(x) € F[x] then E is generated 
by a complete set of distinct roots a,,...,a, of p(x). Thus E= 
F(a,,...,@,) is finitely generated by algebraic elements and so is a 
finite extension of F, of degree at most d!, where d = deg p(x). This also 
applies to the splitting field for any finite set of polynomials over F. 

Suppose that F < E is finite and let B = {8,,...,8,,} be a basis for E 
over F. If F <K, then since EK = K((,...,@,) and each {; is algebraic 
over F, and hence also over K, it follows that EK is the set of 
polynomials over K in ,,...,8,. However, any monomial in the §,’s is 
a linear combination (over F) of (j,...,8, and so EK is the set of 
linear combinations of (,,...,, over K. In other words, B spans EK 
over K. We have proved the following, which says that a lifting cannot 
increase degree. 


Theorem 2.5.4 If B is a basis for E over F and if F < K then B spans 
EK over K. In particular, if F < E is finite then [EK:K] < (E:F]. 0 


The next theorem characterizes finite simple extensions. 


Theorem 2.5.5 A finite extension F<E has the form E= F(a) for 
a € E if and only if there are only a finite number of intermediate fields 
F < K < E between E and F. 


Proof. Suppose first that E = F(a), and that p(x) = min(a,F). Define a 
map 7 that assigns to each intermediate field K the polynomial ¥(K) = 
min(a,K). Since p(x) € K[x] and p(a) = 0, we have #(K) | p(x). But a 
monic polynomial has only a finite number of monic divisors. Hence, 
the range of y is finite and therefore it is sufficient to show that 7 is 
injective. Let K be an intermediate field, let S be the set of coefficients 
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of ~(K) and consider the tower F(S) < K < F(a). Since ¥(K) is a monic 
irreducible polynomial over F(S) and is satisfied by a, we have 7(K) = 
min(a,F(S)). Hence, [F(a):K] = deg ¥(K) = [F(a):F(S)], which implies 
that [K:F(S)] =1, that is, K =F(S). This shows that K is uniquely 
determined by the polynomial ¥(K), and so 7 is injective. 

For the converse, if E is a finite field, the multiplicative group E* of 
nonzero elements of E is cyclic. If a generates this group, then E = F(a) 
is simple. Now suppose that E is an infinite field and there are only 
finitely many intermediate fields between E and F. Let a, 6 € E and 
consider the intermediate fields F(a+aG), for ae F. By hypothesis, 
F(a +a) = F(a+bf) for some a# be F. Hence, a+bf € F(a+af), 
implying that 


B = {la + af) — (a+ bB)] € F(a +f) 


and 
a =(a+afZ)—af € F(a+af) 


Hence, F(a,8) C F(a+af). The reverse inclusion is evident and so 
F(a,8) = F(a +a), showing that any extension of F generated by two 
elements is a simple extension. Since F < E is finite, it is finitely 
generated and an inductive argument can be used to show that F < E is 
simple. lf 


2.6 Algebraic Extensions 


Definition An extension E of F is algebraic over F (or F< E is 
algebraic) if every element a € E is algebraic over F, Otherwise, E is a 
transcendental extension of F. 0 


Theorem 2.6.1 A finite extension is algebraic. 


Proof. If F < E is finite and a € E then the sequence of powers 1, a, 
a*,... cannot be linearly independent over F and therefore some 
nontrivial polynomial in a must equal 0, implying that a is algebraic 
over F. 8 


Corollary 2.6.2 Any extension that is finitely generated by algebraic 
elements is algebraic. 0) 


Theorem 2.6.3 Let F <E. The set K of all elements of E that are 
algebraic over F is a field, called the algebraic closure of F in E. 


Proof. Let a, @ € K. The field F(a,() is finitely generated over F by 
algebraic elements and so is algebraic over F, that is, F(a,@) C K. This 
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implies that a~!, w+ and af all lie in K, and so K is a subfield of 
E.t 


Theorem 2.6.4 The class of algebraic extensions is distinguished. It is 
also closed under the taking of arbitrary composites. 


Proof. For D1, let F << K < E. It is clear that if F < E is algebraic then 
so is F < K. Also, since any polynomial over F is a polynomial over K, 
K <E is also algebraic. Conversely, suppose that F < K and K < E are 
algebraic and let a € E have minimal polynomial p(x) = 5>a,x' over K. 
Consider the tower of fields 


F < F(a,,...,a,) < F(ay,.--,a),@) 


Since @ is algebraic over F(a,,...,a,) and each a;, being in K, is 
algebraic over F, we deduce that each step in the tower is finite and so 
F < F(a,,...,4,,@) is finite. Hence, @ is algebraic over F. 

For D2, let F<E be algebraic and let F< K, with E and K 
contained in a field L. We must show that K < EK is algebraic. Let A 
be the algebraic closure of K in EK. Certainly K < A < EK. Since each 
element of E is algebraic over F it is a fortiori algebraic over K and so 
E <A. Hence, EK < A < EK, showing that EK = A is algebraic over K. 

Finally, if {E,;} is a family of fields, each algebraic over F, then so is 
V E;, since an element of V E; is also an element of a composite of only 
a finite number of members of the family. § 


The algebraic closure of the rational numbers Q in the real numbers 
R is the field A of algebraic numbers. We saw in the previous chapter 
that there is an irreducible polynomial p,(x) € Z[x] of every positive 
degree n. Hence, A is an infinite algebraic extension of Q, showing that 
the converse of Theorem 2.6.1 does not hold. 

We note finally that if F < E is algebraic and if E = F(S) for some 
S CE then each element of E is a polynomial in finitely many elements 
from S. This follows from the fact that each a € F(S) is a rational 
function in finitely many elements of S and so there exists a finite 
subset Sy CS such that a € F(S9). Hence, our discussion in Section 2.5 
related to finitely generated algebraic extensions applies here. 


2.7 Algebraic Closures 


Definition A field E is said to be algebraically closed if any nonconstant 
polynomial with coefficients in E splits in E. 0 
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Theorem 2.7.1 Let F be a field. Then there is an extension E of F that 
is algebraically closed. 


Proof. The following proof is due to Emil Artin. The first step is to 
construct an extension field F, of F, with the property that all 
nonconstant polynomials in F[x] have a root in F,. To this end, for 
each nonconstant polynomial p(x) € F[x], we let X, be an independent 
variable and consider the ring & of all polynomials in the variables X 
over the field F. Let 3 be the ideal generated by the polynomials p(X,). 
We contend that Jj is not the entire ring &. For if it were, then there 
would exist polynomials q,,...,q, € ®& and p,,...,p, € J such that 


qP1(X,, ) haa 3 4nPa(Xp_) =1 


This is an algebraic expression over F in a finite number of independent 
variables. But there is an extension field E of F in which each of the 
polynomials p,(x),...,p,(x) has a root, say aj,...,@,,. Setting X= a 
and setting any other variables appearing in the equation above equal 
to 0 gives 0 = 1. This contradiction implies that J # R. 

Since 14 ®, there exists a maximal ideal 3 such that JCj3CR. 
Then F, = ®/} is a field in which each polynomial p(x) € F[x] has a 
root, namely X,+ 9. (We may think of F, as an extension of F by 
identifying a € F with a + 3.) 

Using the same technique, we may define a tower of field extensions 


F<F,<Fy<::: 


such that each nonconstant polynomial p(x) € F;[x] has a root in F;,j. 
The union E= UF; is an extension field of F. Moreover, any 
polynomial p(x) € E[x] has all of its coefficients in F; for some i and so 
has a root in F;,,, hence in E. It follows that every polynomial p(x) € 
E[x] splits over E. Hence E is algebraically closed. § 


Definition Let F < E. Then E is an algebraic closure of F if F < E is 
algebraic and E is algebraically closed. We will denote an algebraic 
closure of a field F by F.0 


Theorem 2.7.2 Let F < E. The following are equivalent. 


1) Eis an algebraic closure of F. 

2) F<E is algebraic and any nonconstant polynomial p(x) over F 
splits in E. 

3) Eis a maximal algebraic extension of F, that is, F < E is algebraic 
and if E < K is algebraic then K = E. 
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Proof. Clearly 1) implies 2). Suppose 2) holds and E < K is algebraic. 
Let a€K. Then F<E<E(q) is an algebraic tower and so a is 
algebraic over. F. But the minimal polynomial min(a,F) splits in E and 
so a€ E. Thus K = E and 8) holds. Finally, suppose 3) holds and let 
p(x) € E[x]. Any splitting field K for p(x) is algebraic over E and so 
must equal E, which implies that p(x) splits in E. Hence, 1) holds. & 


We can now easily establish the existence of algebraic closures. 


Theorem 2.7.3 Let F < A <E where A is the algebraic closure of F in 
E. If E is algebraically closed then A is also algebraically closed and 
hence is an algebraic closure of F. Thus, any field has an algebraic 
closure. 


Proof. We have already seen that A is an algebraic extension of F. By 
hypothesis, any p(x) € A[x] splits in E and so all of its roots lie in E. 
Since these roots are algebraic over A, they are also algebraic over F 
and thus lie in A. Hence p(x) splits in A and so A is algebraically 
closed. The final statement follows from Theorem 2.7.1. 8 


2.8 Embeddings 


Homomorphisms between fields play a key role in the theory. Since a 
field F has no ideals other than {0} and F, it follows that any nonzero 
(ring) homomorphism o:F-L from F into L must be a monomorphism. 
If f:A—B is any function and if C C A, we denote the restriction of f to 
C by fl ¢. 


Definition Let F and L be fields. A monomorphism o:F-L is called an 
embedding of F into L. We will denote the image of F under o by oF 
or F®. If F < E, an embedding r:E-L for which r |; =¢ is called an 
extension of o to E. An embedding of E that extends the identity map 
uF—F is called an embedding over F, or an F-embedding. We will 
denote the set of all embeddings of E into L over F by Hom,(E,L). If 
p(x) = Dax! € F[x] and if o:F-—L is an embedding we denote the 
polynomial }° o(a, )x! by (op)(x) or p%(x). 0 


Lemma 2.8.1 

1) Let o:F—L be an embedding of F into L and let p(x) € F[x]. Then 
a €F is a root of p(x) if and only if oa is a root of p%(x). 

2) If o:K-L is an embedding of K into L and if {E,|i€I} is a 
family of subfields of K then o( VE;) = VoF. 

3) If o:K—L is an embedding of K into L and if F<K and 
Q,,.-.,@, € K then 
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o(F(a4,...,Q,)) = F7(cay,...,00,) 


Proof. Part 1) follows from the fact that o(p(a)) = p(oa). For part 2), 
since ¢ is injective, it preserves intersections. But 


VE,=(){H|E,<H<K forallie}} 
and so 


o( VE;) = { |{oH | E, <H <K for all ie ]} 


Since o:K—cK is an isomorphism, every H’ satisfying cE; < H’ < oK is 
of the form oH for some H satisfying E; < H < K and so 


o(VE,) = { ){H'| cE, < H’ <oK for all i€ I} = VoE; 
We leave proof of part 3) to the reader. § 


Even though the next result has a simple proof, the result is of major 
importance. 


Theorem 2.8.2 Let F < E be algebraic and let o:E—E be an embedding 
of E into itself over F. Then o is an automorphism of E. 


Proof. Let a € E and let p(x) = min(a,F). Let S be the set of roots of 
p(x) that lie in E. Then a €S. If 8 €S then of is also a root of p(x) in 
E, and so of €S. Hence, o |g is a permutation on S and so there is a 
8B€ES for which co8 =a. This shows that o is surjective, hence an 
automorphism of E. ff 


It is a cornerstone of the theory that an embedding o:F—L into an 
algebraically closed field can be extended to any algebraic extension of 
F. We begin with the case of a simple algebraic extension. 

Suppose that o:F-L is an embedding of F into an algebraically 
closed field L. If F < E and a € E is algebraic over F then we may take 
advantage of the fact that a satisfies its minimal polynomial p(x) over 
F to extend o to F(a) as follows. Since L is algebraically closed, 
p’(x) € F°[x] splits in L, and since o is an embedding, p%(x) is 
irreducible over oF. Hence p?(x) is the minimal polynomial over oF of 
any of its roots in L. Let @ be a root of p7(x) in L. Then 


F(a) = {f(a) | (x) € Fhe, deg f(x) < deg p(x)} 
and since deg p’(x) = deg p(x), 


F°(8) = {8(8) |e(x) € F7[x], deg g(x) < deg p(x)} 
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Thus we may define a map @:F(a)—-F (8) by 
a(f(a)) = £7(8) 


for any f(x) € F[x]. It is straightforward to show that o is an embedding 
of F(a) into F7(@) over o and that Ga = (. This proves the first part of 
the following theorem. The rest of the theorem follows easily. 


Theorem 2.8.3 Let F<E and let a€E be algebraic over F, with 
minimal polynomial p(x) = min(a,F). Let o:F-+L be an embedding of 
F into an algebraically closed field L. 


1) If @ is a root of p%(x) in L then o can be extended to an 
embedding 6:F(a)—L for which Ga = f. 

2) Any extension of o to F(a) must map a to a root of p%(x) in L. 

3) The number of extensions of o to F(a) is equal to the number of 
distinct roots of min(a,F) in F. & 


Zorn’s Lemma can now be used to extend the first part of this 
theorem to arbitrary algebraic extensions. 


Theorem 2.8.4 Let F < E be algebraic. Any embedding o:F-L into an 
algebraically closed field L can be extended to an embedding ¢:E—-L. 
Moreover, if a € E, p(x) = min(a,F) and £ €L is a root of p%(x), then 
we can arrange it so that Ga = @. (See Figure 2.8.1.) 


Proof. Let & be the set of all embeddings r:K->L over o for which 
Ta =f and F<K<E. Theorem 2.8.3 implies that & is not empty. 
Order the elements of & by saying that (7':K'-L) > (r:K-3L) if K < K’ 
and r’ is an extension of 7. Then & is a partially ordered set. If C= 
{7;:K,;-L} is a chain in &, the map 7:(JK;—L defined by the condition 
T|«, =7; is an upper bound for C in &. Zorn’s Lemma implies the 
existence of a maximal extension 7:K—L. We contend that K = E, for 
if not, there is an element y € E—K. But 7 is algebraic over K and so 
we may extend 7 to K(7), contradicting the maximality of r. I 


Lalg. cl. 


algebraic 


o 


Figure 2.8.1 
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We can now establish the essential uniqueness of algebraic closures. 


Corollary 2.8.5 Any two algebraic closures of a field F are isomorphic. 


Proof. Let K and L be algebraic closures of F. The identity map u:F—-F 
can be extended to an embedding 7:K-—-L. Since K is algebraically 
closed so is TK. But L is an algebraic extension of rK and so L=7K. 
Hence, 7 is an isomorphism. &f 


We close this section with a highly useful result on independence of 
embeddings. We choose a somewhat more general setting, however. A 
monoid is a nonempty set M with an associative binary operation and 
an identity element. If M and M’ are monoids, a homomorphism of M 
into M’ is a map ~:M—M' such that ~(a8) = ¥(a)¥(8) and (1) = 1. 


Definition Let M be a monoid and let K be a field. A homomorphism 
x:M—kK*, where K* is the multiplicative group of all nonzero elements 
of K is called a character of M in K. 0 


Note that an embedding o:E-L of fields defines a character 
o:E*L*. 


Theorem 2.8.6 (E. Artin) Any set J of distinct characters of M in K is 
linearly independent over K. 


Proof. Suppose to the contrary that 
O4Xy to + OnXy = 9 


for x;€ I and a; € K, not all 0. Look among all such nontrivial linear 
combinations of the y;,’s for one with the fewest number of nonzero 
coefficients and, by relabeling if necessary, assume that these coefficients 
are Q,...,@,. Thus, 


(2.8.1) 04X,(g) +-+++0,x,(g) = 0 


for all g € M and this is the “shortest” such nontrivial equation (hence 
a; #0 for all i). Note that since x;(g) € K*, we have y;(g) #0 for all 
g €M. Hence, r > 1. 

Since x; # x,, there is a g€M for which y,(g) # x,(g). For any 
h EM, we have 


1X, (gh) aparece a,X,(gh) =0 
that is, 
4X4 (8)x,(h) + ise de + a,X,(8)x,(h) = 0 
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Multiplying (2.8.1) by x,(g) gives 


0X 1(8)X4(h) +++ + x4 (g)x,(h) = 0 


Subtracting the previous two equations gives 


[x 1(8) — Xo(8)]xo(h) +--+ + @,[x1(g) — x,(8)]x,-(h) = 0 


and since the last coefficient is not zero, this contradicts the minimal 
nature of (2.8.1). Hence the characters are linearly independent. I 


Corollary 2.8.7 (Dedekind Independence Theorem) Let E and L be 
fields. Any set of distinct embeddings of E into L is linearly 
independent over L. 0 


2.9 Splitting Fields and Normal Extensions 
Let us repeat a definition from Chapter 1. 


Definition Let ¥ = {f,(x)|i€1} be a family of polynomials in F[x]. A 
splitting field for ¥ over F is an extension field E of F with the property 
that each f,(x) splits in E and that E is generated by the set of all roots 
of every polynomial in F. 0 


It is clear that, given a particular algebraic closure F of F, there is a 
unique splitting field for ¥ in F, since that splitting field must be the 
field generated by the roots in F of all polynomials in F. It is also true 
that any two splitting fields for ¥ are isomorphic by an isomorphism 
that fixes the elements of the base field F. 


Theorem 2.9.1 Let p(x) € F[x]. Any two splitting fields for p(x) over F 
are isomorphic over F. Specifically, if S, and S, are splitting fields for 
p(x) over F and if o:S,-S, is an F-embedding of S, into an algebraic 
closure of S, then o is an isomorphism of S, onto Sp. 


Proof. By Theorem 2.4.8, we may extend the inclusion map jF-S, to 
an embedding o:S,—S, over F. For any such embedding, let R; be the 
set of distinct roots of p(x) in S;. Then p%(x) = p(x) implies that 
oR, CR,. But o is injective and each set R, is finite, whence by 
symmetry, we have oR, = Ry. It follows that 


oS, = o[F(R,)] = F(oR,) = F(R,) = 82 


and so g is an isomorphism. & 
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This result also holds for arbitrary families of polynomials. 


Theorem 2.9.2 Let F be a family of polynomials over F. Any two 
splitting fields for F are isomorphic over F. Specifically, if S, and S, are 
splitting fields for ¥ over F and if o: S,-S, i is an F-embedding of S, 
into an algebraic closure of S, then o is an isomorphism of S, onto Sp. 


Proof. As in the proof of the previous theorem, we have an embedding 
0:8,—S,. Let E, <S, and E, <S, be splitting fields for a polynomial 
p(x) in ¥. Theorem 2.9.1 implies that the restriction of o to E, is an 
isomorphism, whence oE, = E,. Taking the composite over the splitting 
fields E, in S, of all polynomials in F gives 


Recall that if F < E is algebraic then E is an algebraic closure of F if 
and only if any nonconstant polynomial p(x) over F splits in E. Perhaps 
the next best thing would be that every irreducible polynomial p(x) 
over F that has one root in E splits in E. This property happens to 
characterize splitting fields. 


Theorem 2.9.3 Let F < E be algebraic and let F < E<F. The following 
are equivalent. 


1) Eis a splitting field for a family F of polynomials over F. 

2) Every embedding of E into F over F is an automorphism of E. 

3) Every irreducible polynomial over F that has one root in E splits 
in E. 


Proof. [1=>2] Let o be an embedding of E into F over F. Since E is a 
splitting field for a family ¥ of polynomials over F, we have E = F(R), 
where R is the set of roots of the members of ¥. Since o acts as a 
permutation on the roots of any member of ¥, we have oR = R and so 


oE = o(F(R)) = F(oR) = F(R) = 


[2=>3] Let f(x) be an irreducible polynomial over F, with a root @ in 
E. According to Theorem 2.8.4, if @ EF is a root of f(x), then the 
injection j:F-+F can be extended to an embedding o:E—F for which 
oa = #. By hypothesis, o is an automorphism of E, whence is also in 
E. Thus, f(x) splits in E. 

[3=>1] This follows immediately, since E is a splitting field for the 
family ¥ = {min(a,F)|a€ E}. § 
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Definition An algebraic extension F < E that satisfies any (and hence 
all) of the conditions in the previous theorem is said to be a normal 
extension. We also say that E is normal over F. 0 


Corollary 2.9.4 If F <E is a finite normal extension then E is the 
splitting field of a finite family of irreducible polynomials. 


Proof. Let E = F(a,,...,@,). Since E is normal over F, each minimal 
polynomial min(a;,F) splits in E. Clearly, E is generated by the roots of 
min(a;,F) and so E is the splitting field of the finite family 
{min(a;,F)}. ] 


Note that any extension F < E, with E algebraically closed, is normal 
since any nonconstant p(x) € F[x] splits in E. 
As it happens, the class of normal extensions is not distinguished. 


Example 2.9.1 It is not hard to see that any extension of degree 2 is 
normal. The extension Q < Q(4/2) is not normal since Q(4/2) contains 
exactly two of the four roots of x*—2, which is irreducible over Q. On 
the other hand, 


Q< QV2) < Q(4/2) 


with each step of degree 2 and therefore normal. As another example, 
since C is algebraically closed, Q <C is normal but Q < Q(‘/2) is not 
normal. 0 


The previous example notwithstanding, many of the properties that 
define distinguished classes do hold for normal extensions. 


Theorem 2.9.5 

1) IfF <E is normal and F < K < E then K < E is also normal. 

2) The class of normal extensions is closed under lifting: If F < E is 
normal and F < K is any extension then K < EK is normal. 

3) The class of normal extensions is closed under the taking of 
arbitrary composites and intersections: If {E;} is a family of fields, 
each normal over F, and each contained in a single larger field, 
then VE; is normal over F and ()E, is normal over F. 


Proof. Part 1) follows from the fact that a splitting field for a family of 
polynomials over F is also a splitting field for the same family of 
polynomials over K. For part 2), let E be a splitting field for a family ¥ 
of polynomials over F and let R be the set of roots in E of all 
polynomials in ¥. Then E= F(R). Hence, EK = K(R), which shows 
that EK is a splitting field for the family ¥, thought of as a family of 
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polynomials over K. Hence, K<EK is normal. For part 3), let 
o:V E;—F be an embedding over F. Then o is an embedding when 
restricted to each E; and so cE, = E;, whence 


and 


o(ME;) = NeE; = NE; ) 
Normal Closures 


Definition Let F < E be algebraic and let F be an algebraic closure of F 
containing E. The normal closure of F < E in F is the intersection of all 
fields L such that E < L < F and F <L normal. We denote this field by 
gre, o 


Note that since F <F is normal, the intersection described in the 
previous definition is a nontrivial one. 


Theorem 2.9.6 Let F < E < F be algebraic, with normal closure E”°. 


1) E®°¢ is the smallest subfield of F with the property that E < E" 
and F < E”* is normal. 

2) E®°= voE, over all o € Hom,(E,F). 

3) E®° is the splitting field in F of the family {min(a,F) | a € E}. 

4) E®¢ is the splitting field in F of the family {min(a,F)|a€ B} 
where B is a basis for E over F. 

5) If F <E is finite, then F < E”® is also finite. 


Proof. We prove only part 2), leaving the rest for the reader. Let 
E<L<F with F<L normal. Since E<L is algebraic, any o € 
Homp(E,F) may be extended to an embedding 7:L—F over F. Since 
F < L is normal, 7 is an automorphism of L. It follows that oF C L and 
so VoE <E**. On the other hand, if we let L= VoE, then F < L is 
normal since if 7 € Hom,(L,F) then ro runs over all elements of 
Hom,(L,F) as o does and so 


TL =1(VoE) = Vro(E)< VoE=L 
Since F <L is algebraic, we deduce that rL=L, that is, 7 is an 


automorphism of L over F. Hence, F < L is normal and so E™ <L= 
VoE. This shows that VoE = E”. § 
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Exercises 


1. 


2. 


3. 


10. 


11. 


12. 


13. 


Let R be an integral domain containing a field F. Show that if 
[R:F] < oo then R must be a field. 
If F < E is algebraic and R is a ring such that F CR CE, show 
that R is a field. Is this true if F < E is not algebraic? 
Let F < E and F < K be finite extensions and assume that EK is 
defined. Show that [EK:F] < [E:F][K:F], with equality if [E:F] and . 
{K:F] are relatively prime. 
Let o:K-E be a homomorphism of fields and let F < KN E. Show 
that o is F-linear if and only if o(a) =a for all ae F. 
Let F < E be a quadratic ertension, that is, an extension of degree 
2. Show that E has a basis over F of the form {1,a} where a” € F. 
Prove that any extension of degree 2 is normal. 
Let F be an infinite field and let F < E be an algebraic extension. 
Show that |E| = |F|. 
Let F <F where F is an algebraic closure of F and let G= 
Autg(F) be the group of all automorphism of F fixing F pointwise. 
Let 

FS = {a€ F| oa =a for all c EG} 


be the fized field of F under G. Dyidenuly, F < FS < F. Show that 
the minimal polynomial of any a€ FS over F has only one 
distinct root in F. Show also that the minimal polynomial of any 
a€F over F° has no multiple roots. Hint: for the latter 
statement, consider the polynomial p(x) = [](x—a,) where a, are 
the distinct roots of min(a,F™). 

Let p be a prime and let a#1 Pe a complex p-th root of unity. 
Show that min(a,Q)=1+x+x?+---+xP7!. What is the 
splitting field for xP — 1 over Q? 

Suppose that F <E is a finite extension and that E = F(S) for 
some set SC E. Must there exist a finite subset Sp CS for which 
E = F(S,)? 

Let F be a field of characteristic p #0 and let a€ F. Show that 
the following are equivalent: , (i) a EF (ii) F(a") = =F (iii) 
[F(a)]"* C F where [F(a)]" = {s |s € F(a)}. 

Let F <E be a finite normal extension and let p(x) € F[x] be 
irreducible. Suppose that the polynomials f(x) and g(x) are monic 
irreducible factors of p(x) over E. Show that there exists a o € 
Aut,(E) for which f?(x) = g(x). 

Let F <E be algebraic. Show that a normal closure for F < E 
exists and that any two normal closures are isomorphic over F. 
Show also that if F <E is finite, so is F< E™. If F<E is 
algebraic and o € Hom,(E,E) then Ima is contained in the normal 
closure of F < E that lies in E. 
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14. Let F be a field and let a,,...,a@,, be distinct elements of F. Prove 
that if a,ak fore a,ak = 0 for all integers k > 0 then a; = 0 for 
all i. 

15. Show that an extension F <E is algebraic if and only if any 
subalgebra S of E over F is actually a subfield of E. 

16. Let F <E be normal. Can any automorphism of F be extended to 
an automorphism of E? 

17. Suppose that F and E are fields and o:F—-E is an embedding. 
Construct an extension of F that is isomorphic to E. 


Constructions 


The goal of the following series of exercises is to prove that certain 
constructions are not possible using straight edge and compass alone. In 
particular, not all angles can be trisected, a circle cannot be “squared” 
and a cube cannot be “doubled.” The first step is to define the term 
constructible. We assume the existence of two distinct points P, and P, 
and take the distance between these points to be one unit. 


Definition A point, line or circle in the plane is said to be constructible 
if and only if it can be obtained by a finite number of applications of 
the following rules. 


1) P, and P, are constructible. 

2) The line through any two constructible points is constructible. 

3) The circle with center at one constructible point and passing 
through another constructible point is constructible. 

4) The points of intersection of any two constructible lines or circles 
are constructible. 0 


C1. Show that if a line L and point P are constructible, then the line 
through P perpendicular to L is also constructible. 

C2. Show that if a line L and point P are constructible, then the line 
through P parallel to L is also constructible. 

C3. Taking the constructible line through P, and P, as the x-axis and 
the point P, as the origin, the y-axis is also constructible. Show 
that any point (a,b) with integer coordinates is constructible. 

C4. Show that the perpendicular bisector of any line segment 
connecting two constructible points is constructible. 

C5. If P, Q and R are constructible points and L is a constructible line 
through R then a point S can be constructed on L such that the 
distance from S to R is the same as the distance from P to Q. 
(Thus, given distances can be marked off on constructible lines.) 


58 2 Field Extensions 


Definition A real number r is constructible if its absolute value is the 
distance between two constructible points. 0 


C6. Show that any integer is constructible. 

C7. Prove that a point (a,b) is constructible if and only if its 
coordinates a and b are constructible real numbers. 

C8. Prove that the set of numbers that are constructible forms a 
subfield of the real numbers containing Q. Hint: to show that the 
product of two constructible numbers is constructible or that the 
inverse of a nonzero constructible number is constructible, use 
similar triangles. 

C9. Prove that if a>0 is constructible, then so is Ja. Hint: first 
show that a circle of diameter 1+ a is constructible and that a 
line L through the center of the circle is constructible. Let P and 
Q be the intersection points of the circle with the line L. Mark off 
a units along the diameter PQ from P and denote that point by 
R. Is R. constructible? Construct a line M through R perpendicular 
to L. Let S be one point of intersection of M and the circle. What 
is the length of the line segment RS? 


The two previous exercises prove the following theorem. 


Theorem C1 If the elements of a field F <R are constructible, -and if 
a € F, then F(,/a) = {a+b,/a|a, b € F} is constructible. 0 


Theorem C2 Let F be a subfield of R and let E>F be a quadratic 
extension. Then E = F(,/a) for some a € F. 
Proof. Exercise. §f 


It follows from the two previous theorems that if F is constructible 
and if F <E is a quadratic extension then E is constructible. More 
generally, we have 


Theorem C3 If Q< E, <E,<:-::<E, is a tower of fields, each one a 
quadratic extension of the previous one then every element of E,, is 
constructible. 0 


We now turn to a converse of Theorem C3. 


Theorem C4 Let four constructible points, whose coordinates lie in a 
field F <R, be given. Let L and M be lines or circles constructed from 
these points. Then the points of intersection of L and M_ have 
coordinates in a quadratic extension of F. 


Proof. Exercise. § 


The import of the previous theorem is that each time we construct a 
constructible number a, the number lies in a quadratic extension of the 
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field of previously constructed numbers. Thus, we have 


Theorem C5 A real number is constructible if and only if it lies in a 
field E,, that is at the top of a tower of fields 
Q<E, <E,<-:-<E, 


each one a quadratic extension of the previous one. Hence, if a is 
constructible, then [Q(a):Q] must be a power of 2. 0 


Now consider what it means to say that an angle of 6 is 
constructible. Informally, we will take this to mean that we may 
construct a line L through the origin that makes an angle of 6° with the 
X-axis. 


C10. Show that such a line L is constructible if and only if the real 
number cos 6” is constructible. (This is an informal demonstration, 
since we have not formally defined angles.) 


The previous exercise prompts us to make the following definition. 


Definition An angle of @ is constructible if the real number cos @ is 
constructible. 0 


C11. Show that a 60° angle is constructible. 
C12. Show that a 20° angle is not constructible. Hint: verify the formula 


cos 36 = 4 cos? 6 —3 cos 0 
Let 7 = cos 20° and show that Tr is a root of 
p(x) = 8x*—6x-—1 


Show that p(x) is irreducible over Q and so [Q(r):Q] = 3. 

C13. Prove that every constructible real number is algebraic over Q. 
Assuming that 7 is transcendental over Q, show that any circle 
with a constructible radius cannot be “squared,” that is, a square 
cannot be constructed whose area is that of a unit circle. 

C14. Verify that it is impossible to “double” any cube whose side length 
r is constructible, that is, it is impossible to construct an edge of a 
cube whose volume is twice that of a cube with side length r. 


Chapter 3 
Algebraic Independence 


In this chapter, we discuss the structure of an arbitrary field 
extension F < E. Specifically, we will see that, for any extension F < E, 
there exists an intermediate field F <F(S)<E whose second step 
F(S)<E is algebraic and whose first step F<F(S) is purely 
transcendental. The latter means that there is no nontrivial polynomial 
dependency (over F) among the elements of S, and so these elements 
act as “independent variables” over F. Thus, F(S) is the field of all 
rational functions in these variables. 


3.1 Dependence Relations 


We begin with a general notion of dependence, intended to model 
linear independence. 


Definition Let X be a nonempty set and let A C X x 9(X) be a binary 
relation from X to the power set of X. We write x <S (read: x is 
dependent on S) for (x,S) € A and S < T when s < T for all s ES. Then 
A is a dependence relation if it satisfies the following properties, for all 


S, T and U € 9(X), 


1) (reflexivity) 
S<S 


2) (compactness) 


x <~S = x < Sp for some finite subset Sp of S 
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3) (transitivity) 
S<T,T<U>S+<U 


4)  (Steinitz exchange axiom) 
x <~S, x KS— {s} = s <(S— {s}) U {x} 
If x « S we say that x is independent of S. 0 


Definition A subset SC X is dependent if s <«S—{s} for some sES 
(equivalently, if S<S—{s}). A subset SCX is independent if 
s * S — {s} for all s € S. (Hence the empty set is independent.) 0 


The reader should have no trouble supplying a proof for the following 
lemma. 


Lemma 3.1.1 

1) IfS~<T then S <T"’ for any superset T’ of T. 

2) Any superset of a dependent set is dependent. 

3) Any subset of an independent set is independent. 

4) If S is a dependent set, then some finite subset S) of S is 
dependent. Equivalently, if every finite subset of T is independent, 
then T is independent. 0 


Theorem 3.1.2 If S is independent and xS then SU{x} is 
independent. 


Proof. Let seS. If s<(SU{x})—{s} then since skS—{s}, the 
exchange axiom would imply that x~<S, a contradiction. Hence 
s & (SU {x}) — {s}. Furthermore, by hypothesis x « S = (SU {x}) — {x}. 
Thus, S U {x} is independent. § 


Definition A set B C X is called a base if B is independent and X ~ B. 0 


Theorem 3.1.3 Let X be a nonempty set with a dependence relation ~. 


1) BCX is a base for X if and only if it is a maximal independent 
set in X. 

2) BCX < is a base for X if and only if B is minimal with respect to 
the property X ~ B. 

3) Let ACSCX, where A is an independent set (possibly empty) 
and X <S. Then there is a base B for X such that AC BCS. 


Proof. For part 1), assume B is a base. Then B is independent. If B is 
not maximal independent, there exists an x € X — B for which BU {x} 
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is independent. Hence, x «(BU {x})—{x}=B, a contradiction to 
X <B. For the converse, if B is a maximal independent set and x 4 B 
then BU {x} is independent, which is not the case. Hence, X < B and B 
is a base. 

For part 2), if B is a base, then X <~ B. Suppose that some proper 
subset By C B satisfies X < By. If b€ B—Bg then b < By <B — {b}, 
contracting the independence of B. Hence B is minimal. Conversely, 
suppose that B is minimal with respect to the property X ~ B. If B is 
dependent then X ~ B < B — {b} for some b € B, a contradiction to the 
minimality of B. Hence B is independent and a base for X. 

For part 3), we apply Zorn’s lemma. The set ¥ of all independent 
sets B in X satisfying A C B CS is nonempty, since A € f. Order £ by 
set inclusion. If C = {C;} is a chain in ¥, then the compactness property 
implies that the union UC, is an independent set, which also lies in Y. 
Hence, Zorn’s lemma implies the existence of a maximal element C € f, 
that is, C is independent, AC C CS and C is maximal with respect to 
these two properties. This maximality implies that S~C and so 
X ~S <C, which implies that C is a base. If 


To prove that any two bases for X have the same cardinality, we 
require a lemma. 


Lemma 3.1.4 Let S be a finite dependent set and let ACS be an 
independent subset of S. Then there exists a€S—A for which 
S ~S— {a}. 

Proof. Among all subsets of S— A, choose a maximal one B for which 


AUB is independent. Then B is a proper (perhaps empty) subset of 
S—A. If a€S—(AUB) then a x AUB <S — {a} and so S ~S — {a} 8 


Theorem 3.1.5 Any two bases for a set X have the same cardinality. 


Proof. Let B and C be bases for X. We first assume that at least one of 
B or C is finite; say B = {b,,...,b,,} is finite. Choose c, € C. The set 
C, = {c,,b,,...,b,,} satisfies the conditions of the previous lemma 
(with A ={c,}) and so, after renumbering the b,’s if necessary, we 
deduce that 


X <C, < {c,,b,,...,b,, 4} 
For any cy €C—{c,}, the set C, = {c),cy,b,,...,b,,_1} satisfies the 


conditions of the lemma (with A = {c,,c,}) and so, again after possible 
renumbering, we get 


X 4 Cy < {ey,Cp)b4)..-y bo} 
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Continuing this process, we must exhaust all of the elements of C before 
running out of elements of B, for if not, then a proper subset C’ of C 
would have the property that X~C’, in contradiction to the 
independence of C. Hence, |C| < |B]. Since this shows that C is 
finite, we may repeat the argument with the roles of B and C reversed 
to get |B] = |C]. 

Let us now assume that B and C are both infinite sets, and let C = 
{c;|a EI}. Thus, |C| = |I|. For each bE B, we have b ~C and so 
there is a finite subset I, CI such that b ~< {c,|i€I,}. This gives a 
map b—];,, from B to the set of finite subsets of the index set I. 
Moreover, 


I= I 
pep 
for if } €1— Ul]; then, for any b € B, we have 


and so c;<B~<C-—{c;}, which contradicts the independence of C. 
Hence, 


}C] = 111 = | Ute] <Xol Bl = 181 


Again reversing the roles of B and C shows that |B| = |C|.§ 


3.2 Algebraic Dependence 


We recall a definition. 


Definition Let F < E. An element t € E is transcendental over F if t is 
not algebraic over F, that is, if there is no nonzero polynomial p(x) € 


F[x] such that p(t) = 0.0 


Recall that if t is transcendental over F then F(t) is the field of all 
rational functions in the variable t, over the field F. 


Definition Let F <E and let SCE. An element a € E is algebraically 
dependent on S over F, written a <5, if a is algebraic over F(S). If a is 
not algebraically dependent on § over F, that is, if @ is transcendental 
over F(S) then a is said to be algebraically independent of S over F and 
we write a ‘S. 0 


The first order of business is to show that algebraic dependence is a 
dependence relation. 
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Theorem 3.2.1 Algebraic dependence is a dependence relation. 


Proof. Since any s €S is algebraic over F(S) we have S ~S. To show 
compactness, let a~<~S and let C be the set of coefficients of 
min(a,F(S)). Since C C F(S), each c € C is a rational function over F in 
a finite number of elements of S and so there is a finite subset Sy of S 
for which C C F(So). Hence a is algebraic over F(S), that is, a < So. 

For transitivity, suppose that a<S and S~<T and consider the 
tower 

F(T) < F(TUS) < F(TUS,a) 


Since every element of S is algebraic over F(T), and since a is algebraic 

over F(T US) we deduce that a is algebraic over F(T), whence a < T. 
Finally, we verify the exchange axiom. Suppose that a~<S and 

a kS—{s}. Let p(x) = min(a,F(S)). Since F(S) = F(S— {s})(s), the 


coefficients of p(x) are polynomials in s over F(S — {s}), that is, 
d . 
p(x) = > 7 f(s)x! 
i=0 
where we may assume that f(x) # 0. Hence, the polynomial 


d . 
p(x,y) = >_ fi(y)x! 


i=0 
in two independent variables is not the zero polynomial. This 
polynomial can also be written 


p(x,y) = 5 gi(x)y! 
i=0 


where g;(x) € F(S — {s})[x] and g,(x) #0. Then 


0 = p(a,s) = Yale 


Since g,(x) € F(S— {s})[x] is nonzero and a is transcendental over 
F(S —{s}), we infer that g(a) #0 and e>0. Hence, the equation 
above shows that s < F(S — {s} U {a}). I 


We may now take advantage of the results derived for dependence 
relations. 
Definition Let F < E. 


1) A subset S C E is algebraically dependent over F if s < S — {s} for 
some s € S, that is, if s is algebraic over F(S — {s}) for some s € S. 
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2) A subset SC E is algebraically independent over F if s & S — {s} 
for all s€S, that is, if s is transcendental over F(S — {s}) for all 
s € S. (Hence the empty set is algebraically independent over F.) 0 


Lemma 3.2.2 
1) Any superset of an algebraically dependent set is algebraically 
dependent. 


2) Any subset of an algebraically independent set is algebraically 
independent. 0 


Theorem 3.2.3 If S is algebraically independent over F and a@ is 
transcendental over F(S) then SU {a} is algebraically independent over 
F.0 


Let us provide another characterization of algebraically dependent 
sets. 


Theorem 3.2.4 Let F < E. A subset S of E is algebraically dependent 
over F if and only if there is some nonzero polynomial p(x,,...,x,) in 
n > 1 variables over F for which p(sj,...,8,,) = 0, for distinct s; € S. 


Proof. Suppose first that S is algebraically dependent over F. Then 
some s €S is algebraic over F(S — {s}) and so there exists a polynomial 
p(x) of degree d>0 over F(S—{s}) for which p(s)=0. Such a 
polynomial has the form 


p(x) = Sy Paleas Sm) P;(81,--- xi 


i=0 qj G5(8y)-++5 5m) a 


where p;(x,,...,X,,) and q;(x,,...,X,,) are polynomials in m variables 
and the s;€S—{s} are distinct. Note that pg(s),...,8,,) #0 and 
q;(s},---,8,,) #0 for all i. Letting x=s and clearing this of 
denominators gives 


d : 
0= >on(sp. »+)8p)8" 


i=0 


for polynomials = r,(x,,...,x,,), with —rg(s),...,8,,) #0. Thus 
Tq(X1,--+)%,) is not the zero polynomial and p(x) = )o1,(x,,...,%,)X' 
is a nonzero polynomial satisfied by the m+1 distinct elements 
S1)-+-)5_,8 in S. 

For the converse, suppose that p(s,,...,8,) =0 for distinct s, €S, 
where p(x,,...,X,) is a nonzero polynomial over F. We may assume 
without loss of generality that s,,...,8, do not enjoy a similar 
polynomial dependency and hence that 
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d 
P(X,,---5X,) = yy Pj(Xqy +++) Xp) X4 
i=0 


where pg(Xq,.--,X,) #0 and pg(sg,...,8,) #0. Hence the nonzero 
polynomial 


d . 
p(x) = Y> Pi (85-++18q)*! 


i=0 


satisfies p(s,) =0, showing that s, is algebraic over F(S—{s,}) and 
hence that S is algebraically dependent over F. &l 


Corollary 3.2.5 Let F < E and let S = {s,,...,8,} be a subset of E. Then 
S is algebraically independent over F if and only if s,, is transcendental 


over F(s,,...,8,,-,) for all m= 1,...,n. 


Proof. If S is algebraically independent then s,, is transcendental over 
F(S — {s,,}) and therefore also over the smaller field F(s,,...,8,,_,)- For 
the converse, if S is algebraically dependent then there is a nonzero 
polynomial dependency of the form 


d 
0= > Py(849-++1 mai) 8a 
i=0 


for some m <n where p,(s;,.-.,8,,_;) #0, whence s,, is algebraic over 
F(s),...,8,-,)- This contradiction implies that S is algebraically 
independent. § 


3.3 Transcendence Bases 


Definition Let F < E. A transcendence basis for E over F is a subset 
BCE that is algebraically independent over F and for which E ~B, 
that is, for which F(B) < E is algebraic. 0 


Since algebraic dependence is a dependence relation, we immediately 
get the following two results. 


Theorem 3.3.1 Let F < E. A subset B C E is a transcendence basis for E 
over F if and only if it satisfies either one of the following. 


1) Bis a maximal algebraically independent subset of E over F. 
2) B is minimal with respect to the property that F(B)<E is 
algebraic. 0 
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Theorem 3.3.2 Let F < E. 


1) Any two transcendence bases for E over F have the same 
cardinality. This cardinality is called the transcendence degree of 
E over F, and is denoted by [E:F],. 

2) Suppose F CA CS CE where A is algebraically independent over 
F and F(S) <E is algebraic. Then there exists a transcendence 
basis B for E over F satisfying ACBCS. In particular, 
[E:F], < |S|.0 


Definition An extension F <E is said to be purely transcendental if 
E = F(B) for some transcendence basis B for E over F. 0 


We remark that if E is purely transcendental over F then E = F(B) 
for some transcendence basis B, but not all transcendence bases for E 
over F need generate E. The reader is asked to supply an example in 
the exercises. 

The following few simple results concerning transcendental extensions 
will prepare the way to finishing the proof (promised in Chapter 2) that 
the class of finitely generated extensions is distinguished. 


Corollary 3.3.3 If E is finitely generated over F and B.- is a 
transcendence basis for E over F then B is a finite set and F(B) < Eisa 
finite extension. 


Proof. Theorem 3.3.2 implies that B is finite. The second part follows 
from the fact that E is finitely generated over F(B) as well, and a 
finitely generated algebraic extension is finite. § 


Theorem 3.3.4 Let F <K <E and suppose that F < K is algebraic. If 
T CE is algebraically independent over F, then T is also algebraically 
independent over K. In other words, T remains algebraically 
independent over any algebraic extension of the base field. 


Proof. If T is not algebraically independent over K, there exists t € T 
algebraic over K(T—{t}). Since F <K is algebraic, we deduce that 
F(T — {t}) < K(T — {t}) is algebraic, and so each step in the tower 


F(T — {t}) < K(T — {t}) < K(T — {t})(t) = K(T) 


is algebraic, whence t€K(T) is algebraic over F(T—{t}), in 
contradiction to the algebraic independence of T over F. & 


We are now in a position to finish the proof that the class of finitely 
generated extensions is distinguished. Note how much more involved 
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this task is than showing that finite or algebraic extensions are 
distinguished. 


Theorem 3.3.5 Let F < K < E. If E is finitely generated over F then K 
is also finitely generated over F. Thus, the set of finitely generated 
extensions is distinguished. 


Proof. Let S = {s,,...,8,} be a transcendence basis for K over F. Then 
the second step in the tower F < F(S) <<K<E is algebraic and E is 
finitely generated over F(S). Hence, if we can prove the theorem for 
algebraic intermediate fields, we will know that K is finitely generated 
over F(S) and therefore also over F, since S is a finite set. 

Thus, we may assume that F< K<E with F< K algebraic and 
show that [K:F] is finite. Let T = {t,,...,t,} be a transcendence basis 
for E over F. Our plan is to show that 


[K:F] < [E:F(T)] 


(see Figure 3.3.1) by showing that any finite subset of K that is linearly 
independent over F is also linearly independent over F(T) [as a subset 
of E]. Since [E:F(T)] is finite by Corollary 3.3.3, the proof will be 
complete. 


K F(T) 


algebraic 


s 
Figure 3.3.1 


First, we observe that, by Theorem 3.3.4, since T is algebraically 
independent over F, it is also algebraically independent over the 
algebraic extension K of F. 

Let Y = {y,,.--:Ym}©K be linearly independent over F. Suppose 
that 


do1,(t,,...,t,)y; = 0 


where 1;(t,,...,t,) € F(T). By clearing denominators if necessary, we 
may assume that each r,(t,,...,t,) is a polynomial over F. Collecting 
terms involving like powers of the t;’s gives 
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e e 
>» ( een) G)teoct oe =0 


Cyyeeese i 


where Berseeeyeqi E F is the coefficient of t;}---t[” in 1,(t,,...,t,). Since 


T is algebraically independent over K, the products ty). . $n are linearly 


independent over K, and hence also over F(y,,...,Y,,) € K. Thus 


Sere =0 


1 


and the linear independence of Y over F then implies that 


Cp ye rey Onil =0 
Hence 1;(t,,...,t,)=0 for all i. This shows that Y is linearly 
independent over F(T), as desired. § 


The next theorem gives some verisimilar facts about simple 
transcendental extensions; in particular, if E= F(t), where t is 
transcendental over F, then any nonconstant rational function in t is 
also transcendental over F and E is algebraic over any intermediate 
field other than the base field F. 


Theorem 3.3.6 

1) Suppose that E = F(t), where t is transcendental over F. If s = 
f(t)/g(t) € F(t) where f(t) and g(t) are relatively prime and at 
least one is nonconstant, then s is transcendental over F, t is 
algebraic over F(s) and [F(t):F(s)] = maz(deg f(t), deg g(t)). 

2) If t is transcendental over F then F(t) is algebraic over any field K 
satisfying F< K< F(t), K#F. 

3) If F<E is purely transcendental then any a€E-F is 
transcendental over F. 


Proof. For 1), if we show that t is algebraic over F(s), it will follow that 
s is transcendental over F, for otherwise F < F(s) < F(t) would be an 
algebraic tower. The polynomial 


p(x) = g(x)s — f(x) € F(s)[x] 


has the property that p(t) =g(t)s—f(t)=0. Moreover, p(x) is 
irreducible over F(s). For if we think of p(x) as a polynomial in the two 
(independent) variables x and s, it is clear that if p(x) has a nontrivial 
factorization, one of the factors must be a nontrivial common factor of 
f(x) and g(x), which is impossible. Since 
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deg p(x) = maz(deg f(x), deg g(x)) 


part 1) is proved. Part 2) follows easily from part 1). 

For part 3), if a€E—F then a € F(t,,...,t,,) for some finite set 
{t,,...,t,} of algebraically independent elements. By part 1), every 
element of F(t,) not in F is transcendental over F. Similarly, every 
element of F(t,,t,) not in F(t,) is transcendental over F(t,) and hence 
also over F. Continuing this argument gives the desired result. § 


We leave it as an exercise to show that the converse of part 3) is 
false, that is, there exist extensions F<E that are not purely 
transcendental but for which every a € E —F is transcendental over F. 

The following is an example of an extension that is neither algebraic 
nor purely transcendental. 


Example 3.3.1 Let n >3 and let F be a field with char(F)/n. Let u be 
transcendental over F, let v be a root of p(x) =x™+u"—1 in some 
splitting field and let E = F(u,v). Clearly, E is not algebraic over F. We 
contend that E is also not purely transcendental over F. Since v is 
algebraic over F(u), we deduce that {u} is a transcendence basis for E 
over F and so [E:F], = 1. If E were purely transcendental over F there 
would exist a transcendental element t over F for which F(t) = F(u,v). 
Let us show that this is not possible. 
If F(t) = F(u,v) then 


where a(t), b(t), c(t) and d(t) are polynomials over F. Hence 


an(t) | cM(t) 
Bt) + at) ~ 


[a(t)d(t)] + [b(t)e(t)]}" = [b(t)d(t)]" 


This can be written 


or 


f(t) + gr(t) = hr(t) 


for nonconstant polynomials f(t), g(t) and h(t), which we may assume 
to be pairwise relatively prime. Let us assume that deg f(t) < deg g(t), 
in which case deg h(t) < deg g(t). We now divide by h(t) and take the 
derivative with respect to t to get (after some simplification) 


fh = fh’] + ge" le'h _ gh’] =0 
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Since f and g are relatively prime, we deduce that g"™~! | f_h — fh’. But 
this implies 


(n—1)deg g < deg fh—1 = deg f+ degh—1 < 2degg—1 


which is not possible for n>3. Hence, F <F(u,v) is not purely 
transcendental. 0 


While the vector space dimension is multiplicative over a tower of 
fields, the transcendence degree is additive, as we see in the next 
theorem. 


Theorem 3.3.7 Let F < K < E. 


1) If SCK is algebraically independent over F and TCE is 
algebraically independent over K then SUT is algebraically 
independent over F. 

2) If S is a transcendence basis for K over F and T is a transcendence 
basis for E over K then SUT is a transcendence basis for E over 


F. 
3) [E:F], = [E:K], + [K:F], 


Proof. For part 1), suppose for the purposes of contradiction that SUT 
is algebraically dependent over F. Then there exists an a € SUT that is 
algebraic over F(SgUTQ) for some finite sets Sg CS and Ty CT not 
containing a, and we may assume that no proper subset T, of Tp has 
the property that a is algebraic over F(S) UT}). If a € T then since a is 
algebraic over F(SpUTp), it is also algebraic over the larger field 
K(T — {a}), in contradiction to the algebraic independence of T over K. 
Hence a ¢ T and so a€S. But then Ty cannot be empty, since S is 
algebraically independent over F. If t € Tg then the minimality of Ty 
implies that a is not algebraic over SgUT )—{t}, that is, 
a KS gUT,—{t}. But a<SjUT, and so the exchange axiom gives 
t<S gUT )U{a}—{t}. In other words, t is algebraic over 
F(Sp UT) U {a} — {t}), and hence also over the larger field K(T — {t}), 
again contradicting the algebraic independence of T over K. This proves 
part 1). 

For part 2), we know by part 1) that SUT is algebraically 
independent over F. Also, since F(S) < K and K(T) < E are algebraic, 
each step in the tower F(SUT)<K(T)<E is algebraic and so 
F(SUT) <E is algebraic. Hence, SUT is a transcendence basis for F 
over E. Part 3) follows directly from part 2). § 
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*3.4 Simple Transcendental Extensions 


The class of purely transcendental extensions is much less well 
behaved than the class of algebraic extensions. For example, let t be 
transcendental over F. Then in the tower F < F(t”) <F(t), the 
extension F < F(t) is purely transcendental (and simple) but the second 
step F(t) < F(t) is not transcendental at all. 

In addition, if F < E is purely transcendental and F < K < E, it does 
not necessarily follow that the first step F < K is purely transcendental. 
However, this is true for simple transcendental extensions. The proof of 
this simple statement illustrates some of the apparent complexities in 
dealing with transcendental extensions. 


Theorem 3.4.1 (Luroth’s Theorem) Let t be transcendental over F. If 
F <K < F(t) and K #F then K = F(s) for some s € F(t). 


Proof. The idea behind the proof is straightforward. Since K # F, we 
know by Theorem 3.3.6 that K < F(t) is algebraic. Indeed, for any s € 
K —F, the tower F(s) < K < F(t) is algebraic. We want to find an s € 
K—F for which [F(t):F(s)] = [F(t):K], showing that K = F(s). Recall 
from Theorem 3.3.6 that if s = f(t)/g(t)@€K—F where f and g are 
relatively prime polynomials over F, then 


= [F(t):F(s)] = maz(deg f(x), deg g(x)) 
Let 


a,(t) nol 
b,(t)* 


p(x) = min(t,K) = x" + ——~ aieeeat 


where a,(t), b(t) € F(t). Then [F(t):K] =n and we wish to show that 
d, =n for some s € K —F. Evidently d, >n for alls € K—F. 


Note that since t is not algebraic over F, not all of the coefficients of 
p(x) can lie in F. Therefore, we may let 


_ a(t) 

= BO) €K-F 
for some k and assume that a,(t) and b,(t) are relatively prime. 
Consider the polynomial 


h(x) = ay(x) — BAS) 


Since s¢ F, we have h(x) #0. But h(t) =0 and so p(x) | h(x) over K. 
In other words, there exists q(x) € K[x] such that 


ay,(x) — Er baCe) = q(x)p(x) 
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or 


ay.(x) bj (t) — aj(t) by (x) = by (t)a(x)p(x) 
Multiplying both sides of this by 


r(t) = b,(t)---b,(t) 
gives 


(3.4.1) F(t) [aye(x)by (t) ~ ay (t)by(x)] = by (t)a(x)e(t) pC) 
where 


r(t)p(x) = b,(t)---b,(t)x™ + 3 [b, (t)---b;_(t)a,(t)b; y(t): -b,(t)}x"— 


i=1 


Now, we wish to factor out the greatest common divisor g(t) of the 
coefficients of x! (for j =0,...,n) from the right side of this expression. 
Note that g(t) divides the gcd of any two of these coefficients, in 
particular, g(t) divides the gcd of 


by(t)---by(t) and by (t)-+-by_y(t)ay(t)by 4 1(t): +b, (t) 


which is b,(t)---b,_1(t)by4)(t)---b,(t), since a,(t) and b,(t) are 
relatively prime. Hence, once g(t) is factored out of r(t)p(x): 


r(t)p(x) = g(t)p’(t,x) 
where p‘(t,x) € F[t,x] is primitive, in the sense that it is not divisible by 
any nonconstant polynomial in t, we still have as factors among the 


coefficients of p‘(t,x) the polynomials b,(t) and a,(t). Thus, the degree 
of p’(t,x) with respect to t satisfies 


(3.4.2) t—deg(p'(t,x)) > maz(deg a,(t),deg b,(t)) =d, 
Thus, (3.4.1) becomes 
(3.4.3) r(t) [aj (x) by (t) — ay (t)by.(x)] = by (t)a(x)a(t)p'(t,x) 


Next we multiply both sides of (3.4.3) by a polynomial u(t) that will 
clear all of the denominators of q(x), giving 


u(t)r(t) [ay.(x)by(t) — ay (t)by (0) = by (t)q'(t,x)p'(t,x) 


where p’(t,x), q/(t,x) € F[t,x]. Since p’(t,x) is not divisible by any 
nonconstant polynomial in t, we must have u(t)r(t) | b,(t)q/(t,x). Hence, 
there exists a polynomial q’’(t,x) € F[t,x] for which 
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(3.4.4) ay(x)by(t) — ay (t)by(x) = a"(t,x)p"(t,x) 
Now, the t—degree of the left hand side of this equation is at most 
maz(deg a,(t),deg b,(t)) =d, 


and by (3.4.2) the t-degree of the right hand side is at least d,. Hence, 
the t-degree of either side of (3.4.4) is d, and (3.4.2) implies that 
t—deg(q''(t,x)) = 0, that is 


(3.4.5) ay.(x)b,(t) — a, (t)by (x) = q'"(x)p'(t,x) 


where q’(x) € F[x]. Since the right side of (3.4.5) is not divisible by any 
nonconstant polynomial in t, neither is the left side. But the left side is 
symmetric in x and t, so it cannot be divisible by any nonconstant 
polynomial in x either. Hence, q’(x)p’(t,x) is not divisible by any 
nonconstant polynomial in x, implying that q(x) € F, that is, 


(3.4.6) ay,(x) bj (t) — ay(t) by (x) = a"p'( t,x) 


where q”’ € F. Finally, since the x-degree and t—degree of the left side 
of (3.4.6) agree, this is also true of the right side. Hence by (3.4.2), 


n= a-deg(p'(t,x)) = t-deg(p'(t,x)) > d, > n 
Thus, d, =n, and the proof is complete. 


It can be shown that Luroth’s theorem does not extend beyond 
simple transcendental extensions, but a further discussion of this topic 
would go beyond the intended scope of this book. 

We conclude with a determination of all F-automorphisms of a 
simple transcendental extension F(t). Let GL,(F) denote the group of 
all nonsingular nxn matrices over F. The proof provides a nice 
application of Theorem 3.3.6. 


Theorem 3.4.2 Let F < F(t) be a simple transcendental extension and 
let Autp(F(t)) denote the group of all automorphism of F(t) over F 


1) For each A= b Ar GL,(F) there is a unique o, € Autp(F(t)) 
for which 


at +b 
ct +d 


oyit— 
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Moreover, all automorphisms of F(t) over F have the form oy for 
some A € GL,(F). 
2) IfA,Be GL,(F), then 


C,sR=%,%R and C= o7: 


and oy, = og if and only if AB~! is a nonzero scalar matrix. In 
other words, the map 7:GL,(F)—Autp(F(t)) defined by rA = oy 
is an epimorphism with kernel equal to the group of all nonzero 
scalar matrices in GL,(F). 


Proof. Clearly, the map oy can be extended to a homomorphism of F(t) 
over F by setting 


aa( 2) = Head 
ALB) = g(oq(t)) 


Since maz(deg(at+b), deg(ct+d))=1, Theorem 3.3.6 implies that 
[F(t):F(o,t)] =1 and so o,(F(t)) = F(o,t) = F(t), showing that o, is 
surjective. Since oy is injective as well (fields have no nontrivial ideals), 
it is an automorphism of F(t) over F. 

We leave it to the reader to show that o,op = yp and that og =e 
if and only if C is a scalar multiple of the identity matrix. It follows 
that 

oo 


—j=0,=4 and o _jo, =0,=1 


A A 


and so 


Also, 7, = Op if and only ifo,__, = +, that is, if and only if AB"! isa 
scalar multiple of the identity. 

If o € Autp(F(t)) then F(t) = o(F(t)) = F(ct) and so [F(t):F(ot)] = 
1, which by Theorem 3.3.6 implies that ot = o,t for some 2x2 matrix 
over F. Hence, o =c,. Since o' also has the form Op for some matrix 
B, we have 


LC ATR OAR 


which implies that AB = al, for some a € F, whence A is nonsingular. § 


Exercises 

1. Find an example of a purely transcendental extension F < E with 
two transcendence bases B and C such that E = F(B) but F(C) is 
a proper subfield of E. 

2. Let F< E and F < K. Show that [EK:K], < [E:F],. 
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11. 


12. 


Let F < E< K and let T € K —E. Show that [E(T):F(T)], < [E:F], 
with equality if T is algebraically independent over F or algebraic 
over F. 

Use the results of the previous exercise to show that if F< K <E 
and F <L <E then [KL:F], < [K:F], + [L:F],. 

Let F be a field of characteristic # 2 and let u be transcendental 
over F, Suppose that u?+v?=1. Show that F(u,v) is a purely 
transcendental extension by showing that F(u,v) = F(w) where 
w =(1+v)/u. 

Let F<K<E and _ suppose that SCE is algebraically 
independent over K. Prove that F(S) < K(S) is algebraic if and 
only if F < K is algebraic. 

Show that the converse of part 3) of Theorem 3.3.5 is false by 
describing an extension E of F that is not purely transcendental, 
but for which every a € E —F is transcendental over F. 

Prove that the transcendence degree of R over Q is |R|. 

Show that [C:Q], = |C]. 

(An extension of Luroth’s Theorem) Suppose that F < E is purely 
transcendental. Show that any simple extension of F contained in 
E is transcendental over F. 

With regard to Theorem 3.4.2, show that o,op = o,p and og =4 
if and only if C is a scalar multiple of the identity matrix I. 

Prove Lemma 3.1.1. 


Chapter 4 
Separability 


4.1 Separable Polynomials 
Let us recall a few facts about separable polynomials from Chapter 1. 


Definition An irreducible polynomial p(x) € F[x] is separable if it has no 
multiple roots in any extension of F. An irreducible polynomial that is 
not separable is inseparable. {1 


Theorem 4.1.1 
1) An irreducible polynomial p(x) is separable if and only if 
p'(x) #0. 
2) If F is a field of characteristic 0, or a finite field, then all 
irreducible polynomials over F are separable. 
3) Let char(F) = p #0 and let p(x) be irreducible. 
a) If p(x) is inseparable, then there exists a positive integer d 
such that p(x) = q(x” , where q(x) is separable. In this case, 
all roots of p(x) have multiplicity pt. 


b) If p(x) = h(x’) where h(x) is any nonconstant polynomial and 
d is a positive integer, then p(x) is inseparable. 
4) Inseparable polynomials exist. 0 


The exponent d in part 3a) of the previous theorem is quite 
important and deserves a special name. Note that it can be 
characterized as the largest integer for which p(x) = q(x? ). 
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Definition Let p(x) € F[x] be an irreducible , polynomial. If char(F) = 
p#0, the integer d for which p(x) = q(x” ), with q(x) separable, is 
called the radical exponent of p(x). If char(F) = 0, the radical exponent 
of p(x) is defined to be 0. If a is algebraic over F, the radical exponent 
of a over F is the radical exponent of min(a,F). 0 


The following definition allows us to handle the cases char(F) = 
and char(F) = p # 0 simultaneously. 


Definition The exponent characteristic expchar(F) of a field F is defined 
to be 1 if char(F) = 0 and char(F) otherwise. 0 


Thus, any irreducible polynomial p(x) has the form p(x) = q(x") 
where q(x) is separable, p is the exponent characteristic of F and d is 
the radical exponent of p(x). Moreover, p(x) is separable if and only if 
its radical exponent is 0. 


Definition Let F < E. Then a € E is separable over F if a is algebraic 
over F and its minimal polynomial min(a,F) is separable. The 
extension F < E is separable (or E is separable over F') if every element 
of E is separable over F. 0 


Before proceeding, we record a useful lemma. If F is a field and 
S CF then S” denotes the set {s"|s € S}. 
Lemma 4.1.2 Let F < E be algebraic with expchar(F) = p and let S C E. 
1) F(S") = F([F(S)]") for any k > 0. 


2) F(S)= F(S") holds for some k > 1 if and only if it holds for all 
k>1. 


3) F= F™ holds for some k > 1 if and only if it holds for all k > 1. 
Proof. Part 1) follows from the fact that [F(S)]| — FPS) and so 


F({F(S)|") = F(F?(S")) = F(S") 


To prove part 2), suppose that F(S) = F(S") for some k > 1. Using part 
1), we have 
F(S) = F(S") = F((F(S))") < F(F(S)P) = FS”) 


from which we conclude that F(S) = F(S") for all r<k. In particular 
F(S) = F(SP) and so again using part 1), we obtain 
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k+1 
F(S) = F(S?) = F(F(S)]") = F(IE(S)) = FS?" ) 
and so F(S) = F(S") for all r > k as well. For part 3), we observe that 
FY < RP CF 


and so F = F® holds for some k > 1 if and only if F = FP, which holds 
if and only if F = FP for all k > 1.8 


4.2 Separable Degree 


If F <E is algebraic and if o:F-—+L is an embedding of F into an 
algebraically closed field L, we let §&,(E,F) denote the set of all 
extensions of o to an embedding of E into L. Remarkably, the 
cardinality of &,(E,F) does not depend on o or L. 


Theorem 4.2.1 If F < E is algebraic and o:F-+L is an embedding of F 
into an algebraically closed field L then the cardinality of &,(E,F) 
depends only on the extension F < E and not on o or L. In other words, 
if 7:FL’ is an embedding with L’ algebraically closed, then 
| &,(E,F) | = | &,(E,F) | . 


Proof. Observe first that if @ is an extension of o to E then @E is 
algebraic over oF and therefore contained in the algebraic closure of oF 
in L. Hence we may as well assume that L is an algebraic closure of oF. 
Similarly, we may assume that L’ is an algebraic closure of TF. 

Referring to Figure 4.2.1, the map ro~!:o(F)>7r(F) is an 
isomorphism that can be extended, by Theorem 2.8.4, to an embedding 
A:L—L’. Since oF <L is algebraic, so is rF < AL, and since AL is 
algebraically closed, we have \L=L’, implying that A:L-L’ is an 
isomorphism. 

Ifo € &,(E,F) then the map AG:E-L’ is an embedding of E into L’ 
extending 7 on F. This defines a map from &,(E,F) to &(E,F) given by 
GAG. It is clear that this map has an inverse given by 7++A~ ‘7 and so 
both maps are bijections. 


U A=at ao" L 


Le a 
<——*— § ——> 


Pete 


?@s— FS ef) 


Figure 4.2.1 
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In view of Theorem 4.2.1, we may make the following definition. 


Definition Let F < E be an algebraic extension and let o:F—L be an 
embedding of F into an algebraically closed field L. The cardinality of 
the set &,(E,F) is called the separable degree of E over F and is denoted 
by [E:F],. 0 


It will be convenient for our present discussion to adopt the following 
definitions, at least temporarily. 


Definition An algebraic extension F<E is degreewise separable if 
(E:F], = [E:F]. An algebraic extension F < E is separably generated if 
E = F(S) where each a € S is separable over F. 0 


We next prove that the separable degree is multiplicative. 


Theorem 4.2.2 If F < K < E then [E:F], = [E:K],[K:F].,. 


Proof. The set &,(K, F) of extensions of the inclusion map j:F>E to an 
embedding j:K-—E has cardinality [K: F],- Each such extension j€ 
&,(K,F) can be further extended to an embedding j:EF. Clearly, the 
resulting extensions are all distinct and so 


|§(E,F)| > |8(K,F)| |€(E,K) | 


On the other hand, if o € & (E, F) and o9:K—E is the restriction of o to 
K then 2 is the extension of j:F—E to K, hence an element of & (K,F). 
Since o is the extension of a9 to E, o is obtained by a double extension 
of 7:F3E and so equality holds in the inequality above. § 


4.3 The Simple Case 


Now let us consider simple extensions in the present context. Let 
F < F(a) be algebraic. If p(x) = min(a,F) and if j:F-4F is the inclusion 
map then Theorem 2.8.3 implies that [F(@):F], = |§,(F(a),F) | is equal 
to the number of distinct roots of p(x). If P(x) is separable, it has 
deg p(x) = [F(a):F] distinct roots and so 


[F(a):F], = [F(@):F] 


If p(x) = a(x") has radical exponent d > 1, then each root of p(x) has 
multiplicity p° and so 


p4[F(a):F], = deg p(x) = [F(a):F] 
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We thus have the following theorem. 


Theorem 4.3.1 Let F< F(a) be algebraic with expchar(F)=p. If 
min(a,F) has radical exponent d, then 


(4.3.1) p4[F(a):F], = [F(a):F] 


In particular, [F(q):F],|[F(@):F]. Moreover, the following are 
equivalent. 


1) aq is separable over F. 
2) F< F(q) is degreewise separable; that is, [F(a):F], = [F(a):F]. 
3) F< F(a) is separable. 


Proof. We have seen that (4.3.1) holds and since a is separable if and 
only if its radical exponent is 0, it follows that 1) and 2) are equivalent. 
Clearly 3) implies 1). To see that 2) implies 3), let @ € F(a) and 
consider the tower F < F(f) < F(a). Then 


[F(a):F(6)],[F(4):F], = [F(@):F], = [F(@):F] = [F(e):F(A)[F(4):F] 


Since F(a) = F(@)(a), the extension F() < F(a) is simple and so each 
factor on the far left divides the corresponding factor on the far right, 
implying that the corresponding factors are equal. In particular, 
[F(G):F], = [F(@):F], showing (by the equivalence of parts 1 and 2) that 
8 is separable over F. Hence F < F(a) is separable. & 


Note that, according to the previous theorem, if a is separable so is 
any polynomial in a. The following is another characterization of 
separable elements. 


Theorem 4.3.2 Let a be algebraic over F, with expchar(F) = p. Then a 
is separable over F if and only if 


F(a) = F(a") 


for some k > 1, and hence for all k > 1. 


Proof. Lemma 4.1.2 allows us to confine our attention to k = 1. Suppose 
a is separable over F. First suppose that a is separable over F. The 
polynomial (x — a)? = xP — aP € F(a?P)[x] has a as a root and so there 
exists an r < p such that 


min(a,F(a?P)) = (x -—a@)* 
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Since p(x) = min(a,F) also has coefficients in F(a?), we have 
(x — a)" | p(x). But p(x) is separable, and so r=1. Thus r(x) =x-—a, 
implying that a € F(a?) and consequently F(a) = F(aP). 

Conversely, suppose that F(a) = F(a?) and let p(x) = min(a,F). If a 
is not separable over F then p(x) =q(x?). Since q(aP) = p(a) = 0, we 
get 

[F(a?):F] < deg q(x) = d[F(a):F] 


which is contrary to F(a) = F(a). Thus a is separable over F. & 


4.4 The Finite Case 


Now we consider an arbitrary finite extension F <E. By Theorem 
2.5.2, we may let E = F(ay,...,a,,) where a; is algebraic over F. Taking 
separable degrees in the tower 


(4.4.1) F < F(a) < F(a1,Q) rr G F(a). . +) @,) 
gives 


[F(a,,...,@,):F], = [LF ++) 0%):F(04,..+50%_4)] 


Since each step on the right is simple, Theorem 4.3.1 implies that each 
separable degree on the right divides the corresponding vector space 
degree, and so 


[F(ay,..-,@,):F], | [F(a,,---0,):F] 


Theorem 4.4.1 Let F<E be finite. Then [E:F],|[E:F]. Also, the 
following are equivalent. 


1) Eis separably generated. 
2) EF <E is degreewise separable; that is, [E:F], = [E:F]. 
3) EF <E is separable. 


Proof. [1=>2] Suppose that E=F(S) where the elements of S are 
separable over F. The finiteness of F < E implies that E = F(a,...a,), 
for some n > 0, where a; € S. Since a; is separable over F(a,,...,;_1), 
each step in the tower (4.4.1) is generated by a single separable element. 
Hence, each step is degreewise separable and the multiplicativity of 
degrees implies that F < E is degreewise separable. [2=>3] Let @ € E and 
consider the tower F < F(8) <E. Since F < E is degreewise separable, 
so is F < F(f) and so @ is separable over F. Hence, F < E is separable. 
[3=>1] This is clear from the definition. 
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Thus, for finite extensions, the notions of separability, degreewise 
separability and separably generated are equivalent. Note that if 
F<K<E is finite then F <E is separable if and only if K < E and 
F <K are separable. We will show later that the class of separable 
extensions (finite or otherwise) is distinguished. Let us have another 
characterization of finite separable extensions. 


Theorem 4.4.2 If F < E is separable then E = F(EP) for all k>1. By 
way of converse, if F < E is finite and E = F(E?) for some k > 1, then 
F < E is separable. 


Proof. Suppose F < E is separable. Lemma 4.1.2 allows to to confine 
our attention to k = 1. For any a € E, we have F(a) = F(a?) C F(EP) 
and so E C F(EP). The reverse inclusion is obvious and so E = F(EP). 

Now suppose that E = F(EP). Since F< E is finite, we have E = 
F(SP) for some finite subset SCE. Since E = F(S?) < F(S) <E, we 
have E = F(S) = F(SP) and so Lemma 4.1.2 implies that E = F(S?) for 
all k > 1. If d is the maximum of the radical exponents of the elements 
of S then every element of S? is separable over F and so E = F(S®) is 
separably generated over F and therefore separable over F. # 


Corollary 4.4.3 Let F < E be a separable extension and let S C E. 


1) IfS spans E over F, then sr spans E over F, for any k > 1. 

2) If F <E is finite and S is linearly independent over F, then SP is 
linearly independent over F, for any k > 1. 

3) If F <E is finite and S is a basis for E over F, then S* is a basis 
for E over F, for any k > 1. 


Proof. If S spans E over F, then S? spans EP over FP, and hence also 
over F. Hence SP spans F(EP)=E over F. Repeating this argument 
proves part 1). For part 2), since F < F(S) is separable and S spans 
F(S) over F, we conclude from part 1) that SP spans F(S) over F. Since 


[S| = |S| <o 


it follows that S* is a basis for F(S) over F and is therefore linearly 
independent over F. Part 3) follows from parts 1) and 2). i 


We now prove that all finite separable extensions are actually simple 
extensions. 


Theorem 4.4.4 If F < E is a finite separable extension then there exists 
a y € E such that E = F(7). If F is an infinite field, there exist infinitely 
many such primitive elements ¥. 


Proof. If F is a finite field, then so is E, and we appeal to the fact that 
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the multiplicative group E* of nonzero elements of E is cyclic. If E* = 
(7) then E = F(7) and E is simple over F. Let us now assume that F is 
an infinite field. 

Since F < E is finitely generated, it is sufficient to consider the case 
E = F(a,@), and then appeal to an inductive argument. Let O50 29 Op 
be the distinct embeddings of E into F over F. Consider the polynomial. 


p(®) = T]t0;—ei)(a) + (0, ~a)(0)x] € Fs 
1FJ 


Since none of the linear factors on the right is 0, we conclude that 
p(x) #0. Since F is infinite, there must infinitely many elements s € F 
such that p(s) #0. Hence (o;—0;)(a) +s(o;—0;)(8) #0 for all i Fj, 
that is, the n elements 7; = 0,0 +50, B=a;(a +58) are distinct. But 
each 7; is a root of min(a+sf, F) and so 


[E:F] > [F(a +s@):F] > n = [E:F], = [E:F] 
from which it follows that E = F(a +s). 


Corollary 4.4.5 If F has characteristic 0 or if F is a finite field then any 
finite extension of F is simple. 0 


We can improve upon Theorem 4.4.4 without too much additional 
work. This result will prove useful to us later. 


Theorem 4.4.6 If E = F(a,,...,a@,,8) where a; is separable over F and 2 
is algebraic over F then F < E is a simple extension. 


Proof. If F is finite, then E is finite, and therefore F < E is simple. Let 
us assume that F is infinite. Theorem 4.4.4 implies that E = F(a,() for 
some a@ separable over F. We may proceed as in the proof of that 
theorem to obtain an element @ +sa for which the elements o;( + sa) 
are distinct, where o,,...,0,, are the distinct embeddings of E into F 
over F and o, =«. Note that the set {o,a,...,0,a@} contains a complete 
set of roots of p(x) = min(a,F) and {o,8,...,¢,,3} contains a complete 
set of roots of pg(x) = min(A,F). 
Let q(x) = pAlB +sa —sx). Since o,@ = a, we have 


q(o,a) = pg(Z) =0 


and since 0,7 +s0,a —so,a # 0,0 for i# 1, we have 
q(o;a) = pg(o8 + so, —soia) #0 


for i#1. Hence, the polynomials p,(x) and q(x), both of which have 


4 Separability 87 


coefficients in F(@+ oa), have precisely one root in common, namely 
o,a =a. Thus, since p,(x) has no multiple roots, the greatest common 
divisor of p,(x) and q(x) is x—a@, which must have its coefficients in 
F(6+sa) as well. In other words a € F(f+sqa), from which it follows 
that @ € F(G +sa), whence F(a,8) = F(G+sa). 8 


4.5 The Algebraic Case 


For arbitrary algebraic extensions F < E, we have the following. 


Theorem 4.5.1 Let expchar(F) =p. An algebraic extension F <E is 
separable if and only if it is separably generated. If F < E is separable 
then E = F(E") for all k > 1. 


Proof. If F < E is separable then E is separably generated (by itself) 
over F. For the converse, assume that E = F(S) where each a€S is 
separable over F and let G € E. Then ( € F(S,) for some finite subset 
So CS. Since F < F(S,) is finitely generated and algebraic, it is finite. 
Thus, Theorem 4.4.1 implies that F < F(S9) is separable. Hence ( is 
separable over F and so F <E is separable. The last statement was 
proved in Theorem 4.4.2. § 


We. may now establish that the class of separable extensions is 
distinguished. 


Theorem 4.5.2 The class of separable extensions is distinguished. It is 
also closed under the taking of arbitrary composites. If F <E is 
separable and E™ is the normal closure of E over F then F < E™ is 
separable. 


Proof. Let F<K<E. If all extensions are finite, we have already 
shown (by a degree argument) that F <E is separable if and only if 
F < K and K < E are separable. In general, we leave it as an exercise to 
show that if F <E is separable then F < K and K < E are separable. 
Suppose that F < K and K < E are separable and let a€ E. Let CCK 
be the set of coefficients of p(x) = min(a,K). Then p(x) = min(a,F(C)) 
and so a is separable over F(C). It follows that each step in the tower 
F < F(C) < F(C,q) is finite and separable, implying that a is separable 
over F. This shows that F < E is separable and completes verification of 
property D1) in the definition of distinguished class. 

For property D2), let F < E be separable and let F < K. Since every 
element of E is separable over F it is also separable over the larger field 
K. Hence EK = K(E) is separably generated and is therefore separable. 

The fact that separable extensions are closed under the taking of 
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arbitrary composites follows from the finitary property of arbitrary 
composites. That is, each element of an arbitrary composite involves 
elements from only a finite number of the fields in the composite and so 
is an element of a finite composite, which is separable. 

Finally, the normal closure E™ is the composite V(cE) for o € 
Homg(E,E). Since F < E is separable and o:E—cE is an isomorphism 
over F, the elements a€E and oa€cE have the same minimal . 
polynomial and so the separability of a over F implies that of ca, 
whence F <cE is separable. Since separability is preserved under 
composites, F < E™° is separable. & 


4.6 Pure Inseparability 
The antithesis of a separable element is a purely inseparable element. 


Definition An element a@ algebraic over F is purely inseparable over F if 
its minimal polynomial min(a,F) has the form (x — a)" for some n > 1. 
An algebraic extension F < E is purely inseparable if every element of E 
is purely inseparable over F. 0 


Note that any a € F is purely inseparable over F. In fact, an element 
a is both separable and purely inseparable over F if and only if a € F. 
It follows that, for extensions of fields of characteristic 0 or finite fields, 
there are no “interesting” purely inseparable elements. 


Example 4.6.1 Let char(F) = 2. If t is transcendental over F, then t is 
purely inseparable over F(t?), since its minimal polynomial over F(t?) is 
x? —t? =(x—t)*.0 


Example 4.6.2 Here we present an example of an element that is neither 
separable nor purely inseparable over a field F. Let char(F) = p and let 
a € F be nonzero. Let t be transcendental over F and let 
= iP 
5 Pra 


According to Theorem 3.3.6, F(s) < F(t) is algebraic and has degree 
equal to p*. Since t is a root of the monic polynomial 


2 
p(x) = xP —sxP —sa 


2 ; 
of degree xP over F(s), this must be the minimal polynomial for t over 
F(s). Since p(x) = q(x?), we deduce that t is not separable over F(s). 
On the other hand, if t were purely inseparable over F(s), there would 
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exist @ € F(s) for which 
xP’ — 5xP _sa = (x - py = xP pp’ 


which would imply that s = 0, which is not the case. Hence, t is neither 
separable nor purely inseparable over F(s). 0 


Definition Let F < E be finite. Since [E:F], | [E:F], we may write 
_ [E:F] = [E:F],[E:F], 


where [E:F]; is the inseparable degree or degree of inseparability of E 
over F. 9 


Note that, while the separable degree is defined for infinite 
extensions, the inseparable degree is defined only for finite extensions. 


Definition If F < E is algebraic and [E:F],=1, we say that F <E is 
degreewise purely inseparable. When F < E is finite, this is equivalent 
to (E:F]; = [E:F]. 0 

Theorem 4.6.1 Let F < E be a finite extension with expchar(F) = p. 


1) IfF <K<E then [E:F]; = [E:K],[K:F];. 

2) F< FE is separable if and only if [E:F]; = 1. 

3) Ifae€E then [F(a):F]; = p? where d is the radical exponent of a. 
4)  [E:F]; is a power of p. 


Proof. The first three statements are clear. The last statement follows 
from the fact that F < E is finitely generated and the inseparable degree 
is multiplicative. We leave the details to the reader. § 


We next characterize purely inseparable elements. 
Theorem 4.6.2 Let a be algebraic over F, with radical exponent d and 
let p(x) = min(a,F). The following are equivalent. 
1) ais purely inseparable over F. 
2) The polynomial (x — a)" has coefficients in F, for some n > 1. 
2: a) Sasa esl ar 
4) ais a root of xP B, for some @ € F and k > 0. 
5) a® € F for some k > 0. 


6) dis the smallest nonnegative integer for which af EF, 
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Proof. We establish only those implications that are not immediate. 
Recall that p(x) = q(x?) where q(x) is separable over F. 

[2=>3] If 2) holds then q(x") |(x—«@)" and so q(x") = (x-a)" for 
some 1 <r <n. It follows that r= mp°, where m = deg q(x). Hence, 


q(x®") = (x — a) ™* = (x oF) 


Thus q(x) = (x— ae) and the separability of q(x) implies that m = 1, 
whence , 
r = p® and p(x) = (x—a@)P 


(5=>6] If 5) holds then 
r(x) = PY Pt (x- a) 


is a polynomial over F with r(a) = 0. Hence q(x") | (x- a)", showing 
that k > d. Since r(x) € F[x], the fact that 2) implies 3) shows that 


p(x) = x" — oF? 


and so a™ € F. Hence d is the smallest integer for which af eF. 
[6=>1] If 6) holds, then 


r(x) =(x- a) = xP_ oF 


is a polynomial over F with a as a root, and so p(x) | r(x). Hence, p(x) 
has the form (x—a@)" for some n > 1 and a is purely inseparable over 
PF. 


Theorem 4.6.3 Let F < E be algebraic. The following are equivalent. 


1) E is purely inseparably generated; that is, generated by purely 
inseparable elements. 

2) EF <E is degreewise purely inseparable; that is, [E:F], = 1. 

3) F< E isa purely inseparable extension. 


Proof. [1=>2] Suppose first that E = F(I) where all elements of I are 
purely inseparable over F. Any embedding o:E-L over F is uniquely 
determined by its values on the elements of I. But if a EI then oa is a 
root of the minimal polynomial min(a,F) and so oa = a. Hence o must 
be the identity and [E:F], = 1. 

[2=>3] Let a € E. Then [F(q):F], = 1 and since F < : F(a) is a finite 
extension, Theorem 4.3.1 implies that 


= [F(a):F] = deg min(a,F) 
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d 
Since min(a,F) = q(x? ), it follows that q(x) is linear and so 
min(a,F) = ae B 


for some 8 €F, which implies by Theorem 4.6.2 that a@ is purely 
inseparable over F. [3=>1] This is clear. § 


We can now show that the class of purely inseparable extensions is 
distinguished. 


Theorem 4.6.4 The class of purely inseparable extensions is 
distinguished. It is also closed under the taking of arbitrary composites. 


Proof. Let F<K<E. Since pure inseparability is equivalent to 
degreewise pure inseparability and (E:F],= 1 if and only if [E:K],=1 
and [K:F], = 1, it is clear that D1) holds. For D2), suppose that F < E 
is purely inseparable and F < K. Since every element of E is purely 
inseparable over F, it is also purely inseparable over the larger field K. 
Hence EK = K(E) is purely inseparably generated and therefore purely 
inseparable. We leave proof of the last statement to the reader. § 


4.7 Separable and Purely Inseparable Closures 

Let F <E. According to Theorem 4.4.1, if a, @ EE are separable 
over F then F(a,() is separable over F. It follows that a+ 8, af, and 
a! (for aw #0) are separable over F. Hence, the set of all elements of E 
that are separable over F is a subfield of E. A similar statement holds 
for purely inseparable elements. 
Definition Let F < E. The field 

F*° = {a € E| @ separable over F} 
is called the separable closure of F in E. The field 
F'* = {a € E| qa is purely inseparably over F} 

is called the purely inseparable closure of F in E. 0 


The separable closure allows us to decompose an arbitrary algebraic 
extension into separable and purely inseparable parts. 
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Theorem 4.7.1 Let F < E be algebraic. 


1) In the tower F < F*° < E the first step is separable and the second 
step is purely inseparable. 


2) QE Fl C F®<, 
3) Any embedding o:E-—E is uniquely determined by its restriction 
to F*, 


Proof. For part 1), if a € E has radical exponent d, then min(a,F) = 
q(x’) where q(x) = min(a’,F) is separable and so a? € F**. Thus, 
Theorem 4.6.2 implies that a is purely inseparable over F*°. This shows 
that F°°<E is pusrely inseparable. For part 2), since pa = 
[F(a):F]; | [E:F]; we see that 


EF; € FS 


for all a € E and so perl; C F*°, We leave proof of the last statement 
to the reader. & 


Corollary 4.7.2 Let F < E be finite. Then [E:F], = [F°°:F] and [E:F]; = 
[E:F*‘]. 0 


Part 1 of Theorem 4.7.1 shows that any algebraic extension can be 
decomposed into a separable extension followed by a purely inseparable 
extension. In general, the reverse is not possible. Although F < F’* is 
purely inseparable, the elements of E — F'° need not be separable over 
F; they are simply not purely inseparable over F. However, it is not 
hard to see when F’° < E is separable. 


Theorem 4.7.3 Let F < E be algebraic. Then F° < E is separable if and 
only if E = F°°F"’’, 


Proof. If F'* < E is separable then so is FSF < FE. But since F© < E is 
purely inseparable, so is F scpic <E. Thus, we have E= FSFiC, 
Conversely, if E=F®°F© then F° < FS°F!S, being a lifting of a 
separable extension F < F*°, is also separable. 1 


We can do better than the previous theorem when F < E is a normal 
extension, which includes the case E = F. Let G = Autp(E) be the set of 
all automorphisms of E over F. Since F < E is normal, G is also the set 
of all embeddings of E into F over F. We define the fixed field of G in 
E by 


F(G) = {a € E| ca = @ for all o € G} 
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Theorem 4.7.4 Let F < E be a normal extension. Let G = Aut,(E) and 
let F(G) be the fixed field of G in E. Then F(G) = F'°, Furthermore, in 
the tower F<F'°<E, the first step is purely inseparable and the 
second step is separable. 


Proof. Let a € F(G). If @ EF is a root of p(x) = min(a,F) then there 
exists an embedding o:E—F over F for which oa = f. But oa =a and 
so 8 =a. Hence min(a,F) has only one root and so a € F**. On the 
other hand, if a€F° then any o €G must map a to itself, since it 
must map a to a root of min(a,F). Hence a € F(G). This proves that 
F(G) = F**, 

Now let a € E and p(x) = min(a,F(G)). Let. q(x) = [](x—1;) where 
R = {r,,...,t,} is the set of distinct roots of p(x) in E. Since any 7 €G 
is a permutation of R, we deduce that q?’(x) =q(x) and so the 
coefficients of q(x) lie in F(G). Hence q(x) = p(x) and a is separable 
over F(G). § 


Corollary 4.7.5 If F <E is normal then F*° < E is separable and E = 
Fscpic, o 


Let us conclude this section with a characterization of simple 
algebraic extensions. If E = F(a) is a simple algebraic extension of F 
and if d is the radical exponent of a, we have seen that p% = [E: F]; is 
the smallest nonnegative power of p such that a? is separable over F, or 
equivalently, such that EF CF**. It turns out that this property 
actually characterizes simple algebraic extensions. Before proving this, 
we give an example where this property fails to hold. 


Example 4.7.1 Let u and v be transcendental over K with char(K) = 
p #0. Let E = K(u,v) and F = K(uP,v?). It is easily seen that F < E is 
purely inseparable with [E:F], = p’. However, a € E implies aP € F and 
so EPCF.Q 


We next require the following useful lemma. 


Lemma 4.7.6 If char(F) = p # 0 and a€ F, a ¢ FP then f(x) = x” a is 
irreducible for every k > 1. 


Proof. Let 8 € F be a root of f(x) = x” _ @. Then 
f(x) = (x 6)" 
If p(x) = min(@,F) then p(x)|f(x) and so p(x) =(x— aye for some 


d<k. But ifd <k then 
pM er 
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and so 


k k-1 
a=pP =(gP )PeErFP 
contrary to assumption. Hence d = k and f(x) is irreducible. § 


Theorem 4.7.7 Let F <E be a finite extension with [E:F]; =p‘. Then 
F < E is simple if and only if d is the smallest nonnegative integer for 
which EP C F®, 

Proof. We have seen that if F < E is simple then d is the smallest such 
nonnegative integer. For the converse, note first that if F is a finite field 
then so is E, implying that E* is cyclic and so F < E is simple. Let us 
assume that F is an infinite field and look at the second step in the 
tower F< FS° < E. This step is purely inseparable. Since F*° < E is 
finite, we have 


E = F°°(6,,..++By) 


If for some k < d, we have pe € F® for all i, then EP C F*, contrary to 
hypothesis. Hence one of the ;’s, say (, satisfies 


BP CR, Bh ¢ FS fork <d 
It follows that 
[F°°(8):F*]; = pt = [E:F]; > [E:F°, 


Since F*°(@)<E, we have [F*(@):F*], =[E:F°]; and since the 
extensions involved are purely inseparable, we get [F°°(8):F*°] = [E:F*]. 
Hence, E = F*°(f). 

Our tower now has the form F < F*° < F°°(8) where # is purely 
inseparable over F*°. In addition, F < F® is finite and separable and 
therefore simple. Thus there exists a € F*° such that F°° = F(a) and the 
tower takes the form F < F(a) < F(a,@) where @ is separable over F 
and @ is purely inseparable over F(a). By Theorem 4.4.6, the extension 
F < F(a,Q) is simple. & 


Note that Theorem 4.7.7 implies that the extension F<E of 
Example 4.7.1 is not simple. 


4.8 Perfect Fields 


Definition A field F is perfect if every irreducible polynomial over F is 
separable. 0 
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It is clear from the definitions that if F is perfect then any algebraic 
extension of F is separable. Conversely, suppose that every algebraic 
extension of F is separable. If p(x) € F[x] is irreducible and a@ is a root 
of p(x) in some extension of F then F < F(a) is algebraic and so a is 
separable over F, that is, p(x) is separable. Thus, F is perfect. 


Theorem 4.8.1 A field F is perfect if and only if every algebraic 
extension of F is separable over F. 0 


Theorem 4.8.2 Every field of characteristic 0 and every finite field is 
perfect. 0 


Note that if expchar(F) = p then FP = {aP | a € F} is a subfield of F. 
The map ¢:F-—F defined by Opa = a? is called a Frobenius map. It is a 
monomorphism since aP + BP = (a+ f)?. 


Theorem 4.8.3 Let F be a field with expchar(F) = p. The following are 
equivalent. 


1)‘ F is perfect. 

2) F =F for some (and hence all) k > 1. 

3) The Frobenius map oo is an automorphism, for some (and hence 
all) k >1. 


Proof. [1=>2] Suppose F is perfect. Let a@€F and consider the 
polynomial p(x) = xP—a€ Fx]. If @ is a root of p(x) in a splitting 
field then GP = a and so 


p(x) = xP — 6 = (x-p)P 


Hence £ is purely inseparable over F. But ( is also separable over F and 
therefore 3 € F. Hence, a € FP for all a€ F, that is, F C FP. Since the 
reverse inclusion is manifest, we have F = FP. Lemma 4.1.2 implies the 
desired result. 

[21] We may assume that p>1. If 2) holds then Lemma 4.1.2 
implies that FP = F. It follows that if p(x) € F[x] is irreducible but not 
separable, then 


p(x) = D> a,(xP)! = So bP(yP = (} > b,x!)P 


contradicting the fact that p(x) is irreducible. Hence, p(x) is separable 
and so F is perfect. Since the Frobenius map is a monomorphism, 
statements 2) and 3) are easily seen to be equivalent. 


While it is true that any algebraic extension of a perfect field is 
perfect, not all subfields of a perfect field need be perfect. 
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Theorem 4.8.4 
1) If F <E is algebraic and F is perfect then E is perfect. 
2) If F < Eis finite and E is perfect then F is perfect. 


Proof. Part 1) follows from Theorem 4.8.1 and the fact that every 
algebraic extension of E is an algebraic extension of F. For part 2), let 
expchar(F) = p and suppose first that F < E is simple. Thus, E = F(a) 
is perfect and @ is algebraic over F, with minimal polynomial p(x) = 
>oax’. Then 
0= (Daal? = Dapar 

Hence, the degree of aP over FP is no greater than the degree of a over 
F, in symbols, [F?(a?):FP] <[F(a):F]. But F?(aP) = [F(a)]? = F(a) 
since F(a) is perfect and so [F(a@):F?] < [F(a):F]. Since FP < F, equality 
holds and FP = F, whence F is perfect. Since F < E is finitely generated 


by algebraic elements, the result follows by repetition of the previous 
argument. If 


Note that we cannot drop the finiteness condition in part 2) of the 
previous theorem since, for example, F < F is algebraic and F is perfect 
even if F is not. 


Perfect Closures 


Let char(F) = p #0 and let F be an algebraic closure of F. For each 
k > 1, the set 


FP — {ae Fla" € F} 
is a subfield of F. Moreover, we have 


FCFUP¢ FUP’ Cc Mg 
The union . 
iad k 
pel(F) = Ur 
k=1 


which is also a subfield of F, is known as the perfect closure of F in F, 
which name is justified by the following theorem. 


Theorem 4.8.5 Let F be a field of characteristic p #0. Then pcl(F) is 
the smallest perfect subfield of F containing F. 


Proof. To see that pcl(F) is perfect, observe that if a € pel(F) | then 
a” €F for some k > 1. Hence, letting § be a root of xP—a in F, we 
have a = GP, where 
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k+1 k 
BP =aP €F 


and so # € pcl(F). This shows that pcl(F) C [pcl(F)]?. Since the reverse 
inclusion is obvious, it follows that pcl(F) is perfect. 

In addition, if F < K < pcl(F) and a € pel(F) — K, the fact that a? € 
F for some k > 1 implies that a is purely inseparable over F and hence 
also over K. But since a is not in K, it cannot be separable over K as 
well. Thus K is not perfect. 5 


Exercises 

1 Lett F< K<E. If F <E is separable, show that F < K and K < E 
are separable. 

2. Prove that if F <E is finite and separable then there are only 
finitely many intermediate fields between E and F. 

3. Show that all algebraically closed fields are perfect. If t is 
transcendental over F then F(t) is not perfect. 

4. Let @ be algebraic over F, where expchar(F) =p and let d be the 
radical exponent of a. Show that a? is separable over F if and 
only if k >d. 

5. Let p and q be distinct primes. Then Q < QP, 4 ) is finite da 
separable and therefore simple. Describe an infinite class of 
primitive elements for this extension. Find the minimal 
polynomial for each primitive element. 

6. Let E=F(a,,...,a@,,) be separable over an infinite field F. Prove 
that there are an infinite number of n-tuples (a,,...,a,) € F” for 
which E = F(aja, +++: +a,q, 

7. Show that the class of purely inseparable extensions is closed 
under the taking of arbitrary composites. 

8. Let F<E. Define the purely inseparable closure of F in E and 
show that it is a field. 

9. If F <E is algebraic prove that any embedding o:E—E is uniquely 
determined by its restriction to F°°(E). 

10. Prove that if F < E is finite and expchar(F) =p then [E:F]; is a 
power of p. 

11. Show that lifting an extension by a purely inseparable extension 
does ‘not affect the separable degree. That is, show that if F < E is 
algebraic and F < P is purely inseparable then [EP:P], = [E:F].,. 

12. Let F<S be finite separable and F<P be finite purely 
inseparable. Prove that P < SP is separable and [SP:P] = [S:F]. In 
fact, if B is a basis for S over F, prove that it is also a basis for SP 
over P. 

13. Show that if F< E is finite and F <S is finite separable then 
[ES:S]; = [E:F];. 
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14. 


15. 


16. 
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Let F < E be a finite extension and let a € E be algebraic over F. 
Let H be the set of embeddings of E into E over F. The elements 
of H permute the roots of p(x) = min(a,F). Let 6B be a root of 
p(x). Show that |{o€H|ca=}| =[E:F(a)],. Hence, the 
multiset {oa|o€H} contains [E:F(a)], copies of each root of 
p(x). 

Let F < E be a finite extension that is not separable. Show that 
for each n > 1 there exists a subfield E,, of E for which E, < E is 
purely inseparable and [E:E,]; = p™. 

Prove that if F # pel(F) then the extension F < pcl(F) is infinite. 


Part 2 
Galois Theory 


Chapter 5 
Galois Theory I 


5.1 Galois Connections 


The traditional Galois correspondence between intermediate fields 
and subgroups of the Galois group is one of the main themes of this 
book. We choose to approach this theme through a more general 
concept, however. 


Definition Let P and Q be partially ordered sets. A Galois connection 
on the pair (P,Q) is a pair (II,Q) of maps I:P-+Q and 02:Q—P, where 
we write II(p) = p* and Q(q) =q’, with the following properties: 


1) (order reversing) For all p€ P, qEQ, 
p<q=>p*>q* and r<s>r'>s' 


2) ForallpeP,qeEQ, 
psp” and q<q” O 


Theorem 5.1.1 For any p € P and q € Q, we have 


i): pet, 
2) q’*' _— q’. 


Proof. Since p < p*’, the order reversing property of * gives 
p** < p* < (p*)’* 


from which part 1) follows. Part 2) is similar. & 


102 5 Galois Theory I 


Corollary 5.1.2 The map p—p”’ is a closure operation on P, that is, if 
we denote p*’ by ci(p), then for all p € P, q€ Q, 


1) (Extensive) 
p< clip) 


2) (Idempotent) 
el(el(p)) = el(p) 


3)  (Isotone) 
pP<q = cl(p) < cl(q) 


Similarly, the map qq” is a closure operation on Q. 0 


Definition An element p€P is said to be closed if ci(p)=p, and 
similarly for Q. We denote the set of all closed elements in P by Ci(P), 
and similarly for Q. 0 


Theorem 5.1.3 The image of any element under II or 2 is closed. In 
addition, the maps II and {2 are order—reversing bijective inverse maps 


between the sets Cl(P) and Cl(Q). 


Proof. Theorem 5.1.1 shows that the image of an element under II or Q 
is closed. Moreover, if q € C\(Q) is closed, then q’ € CP) and IIq’ = 
q’* = cq) = q is in the image of II, and so II maps CP) onto CQ). If 
p, r€ CUP) and p*=r"*, then p*’=r*’, that is, p=r. Hence II is 
injective. A similar argument applies to 2. Finally, since p*’ =p for 
p € Cl(P), it follows that QoII =e on CKP) and similarly, Io =. on 
CQ). # 


Theorem 5.1.4 Let II:P-Q and 2:Q-P be a Galois connection, where 
P and Q are lattices. 


1) If p,j€ CXP) and Ap, exists in P, then Ap,€ CUP). If P is a 
complete lattice then so is Ci(P), with meet given by meet in P. 
Similar statements hold for Q. 

2) De Morgan’s Laws hold in CP) and CQ). That is, for p, 
q € Cl(P) and r, s € Cl(Q), we have 


(pAq)* =p" Vq", (pVq)" =p" Aq" 
(rAs)'=r'Vs’, (rVs)'=r' As’ 


Proof. For part 1), suppose that p; € Cl(P) and Ap, exists as a meet in 
P. Since Ap; <p; for all j, we have cl Ap;) < cl(p;) = Pj, whence 
cl Ap;) < Ap;. Since the reverse inequality holds as well, we have 
equality, whence Ap; € Ci(P). It follows from Theorem 0.1.1 that if P 
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is a complete lattice, so is CP), under meet in P. 

For part 2), observe first that pAq<p and pAq<q imply that 
(p Aq)* > p* and (pAq)* > q*, whence (pAq)* > p* Vq*. If r > p* and 
r>q"* for r€ CP) then r’<p and r'<q, whence r'<pAq. Thus, 
t>(pAq)*. It follows by definition of join that (p Aq)* = p* Vq*. The 
other parts of De Morgan’s Laws are proved similarly. 


Let Z* denote the set of positive integers. 


Definition We will say that a Galois connection (1,2) on (P,Q) is 
indexed if the following hold. For each p, q € P with p <q, there exists 
a number (q:p) € Zt U {oo}, called the degree of q over p. Similarly, for 
each r, s€ Q with r<s, there exists a number (s:r) € Zt U {oo}, called 
the degree of s over r. Moreover, the following properties hold. 


1) (Degree is multiplicative) If s,, s,, 3 € P or s,, 89, 83 € Q then 
81 <8q < 83 => (83:5,) = (S3'8q)(89:8,) 
2) (and OQ are degree-nonincreasing) If p, q € P then 


p <q = (p":q") < (a:p) 
If r,s € Q then 
r<s => (r':s’) < (s:r) 
3) Ifs,te€Pors, t€Q then 
(st) =l=>s=t 


If (s:t) < oo, then s is said to be a finite extension of t. (We observe 
some obvious understandings about oo; for instance, n < oo for all n€ 
Zt, 00 < 00, n+00 = o forn € Zt and «0 <k < implies k = 0.) O 


From now on, when writing (p:q), it is with the tacit assumption 
that p<q. While II and © are degree-nonincreasing in general, these 
maps are degree preserving when restricted to Cl(P) and Cl(Q), as we 
now show. 


Theorem 5.1.5 Let (11,2) be an indexed Galois connection on (P,Q). 


1) If p,q € CKP) and p <q then (q:p) = (p*:q"). 
2) If pe CUP) and (q:p) < oo then q€ Ci(P). In particular, if 0 is 
closed and (1:0) is finite then all elements are closed. 


Similar statements hold for Q. 
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Proof. If p € Cl(P) then p = p*’ and so 
(a:p) 2 (p*:a*) 2 (a":p™) = (a":p) = (q":q)(a:p) 


If qe CP) then q*’=q, equality holds throughout and part 1) is 
proved. If (q:p) is finite then we may cancel to get (q*’:q) =1, which 
implies that q = q”’ is closed. This proves part 2). § 


Thus, an indexed Galois connection induces a degree-preserving, 
order-reversing bijection between Cli(P) and Cl(Q). 


5.2 The Galois Correspondence 
Now we describe one of the most important Galois connections. 


Definition The Galois group of an extension F < E, denoted by G,(E), 
is the group of all automorphisms of E over F. 0 


Note that when F<E is algebraic, Theorem 2.8.2 implies that 
Gp(E) = Hom,(E,E). 

Let F < E and let ¥ be the complete lattice of all intermediate fields, 
that is, fields K such that F < K < E, ordered by set inclusion. Let G be 
the complete lattice of all subgroups of the Galois group G,(E), ordered 
by set inclusion. We define two maps II:¥-G and 0:g—-F by 


II(K) = Gx(E) 
Q(H) = F(H) = {a € E| ca = cs for all o € H} 


and 


where F(H) is the fixed field of H. 


Theorem 5.2.1 Let F<E. The pair of maps (II,Q2) defined by 
II:Kr+G_(E) and 2:H+F(H) is a Galois connection. We refer to it as 
the Galois correspondence of F < E. 


Proof. It is clear from the definitions that 
K CJ => G,(E) C G,(E) 
and 


H CI => F(I) C F(H) 


Also, any element of K is fixed by every element of Gy(E), that is, 
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KC F(Gx(E)) 
Finally, any o € J fixes every element in F(J), that is, 
JCG F( (FE) | 
Since ¥ and § are complete lattices, Theorem 5.1.4 gives 


Corollary 5.2.2 The set Cl(¥) of closed intermediate fields and the set 
C\(G) of closed subgroups of Gp(E) are complete lattices, where meet is 
intersection. In particular, the intersection of closed intermediate fields 
is closed and the intersection of closed subgroups is closed. 0) 


We would like to show that the Galois correspondence of an 
algebraic extension F < E is indexed, where (K:L) = [K:L] is the degree 
of the extension F < E and (H:J) is the index of the subgroup J in the 
group H. It is not hard to see that these degrees satisfy the first and 
third properties in the definition of an indexed correspondence. The 
next theorem shows that the map II:K++G,,(E) is degree-nonincreasing. 


Theorem 5.2.3 Let F < E be algebraic and let F < L < K < E. Then 
(5.2.1) (Gi (E):Gx(B)) < [KL], < [KL] 


1) If F<E is normal, then equality holds in the first inequality in 
(5.2.1) and the map ~:G,(E)—Hom,(K,E) defined by Yo =o |, 
induces a bijection 

G,(E) 
Gy(E) 


«+ Hom, (K,E) 


2) If F<E is both normal and separable, then equality holds 
throughout (5.2.1). 


Proof. If o, r € G,(E), the following are equivalent 


olk=TIk 
toa = a, for all ae K 
t 10 € Gy(E) 
o €TG,(E) 
Thus, Yo = yr if and only if o and 7 lie in the same coset of G,(E). 
Hence y induces a bijection from the set of cosets of Gy(E) in G,(E) 
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onto Im(). Since 
Im(p) € Hom, (K,E) C Hom, (K,E) 


we get 


(G,(E):Gx(E)) = | Im(¥) | < | Hom, (K,E) | 
< | Hom,(K,E) | = [K:L], 


which proves the first part of the theorem. 

To prove part 1), suppose that F < E is normal. Then L < E is also 
normal. If o € Hom,(K,E) then o can be extended to ¢ € Hom,(E,E) = 
G,(E). It follows that o maps K into E, whence o € Hom, (K,E) and so 
Hom,(K,E) = Hom,(K,E). Moreover, since any o € Hom, (K,E) can be 
extended to G€G,(E) and since o= yo € Im(y), it follows that 
Im() = Hom,(K,E) and so equality holds in both inequalities in the 
previous display. This proves part 1). Part 2) is clear. § 


To show that the map H+F(H) is degree-nonincreasing, we require a 
preliminary result. 


Theorem 5.2.4 Let F < E. Let H C Gp(E). For a € E, define @:H-E by 
ao =oa (thus, @ is evaluation at a). Then a,...,o, are linearly 
independent over F(H) if and only if @,,...,@, are linearly independent 
over E. 


Proof. Suppose that the @;’s are independent over E, and let }) ao; = 0 
where a; € F(H). Then for any o € H, 


0= (x a;;) = Vai(oa;) = Lai(ae) 


Hence })a,@, = 0, implying that a; = 0 for all i. 

For the converse, suppose that a,,...,a, are independent over F(H) 
and let }°x,4,=0 on H. If this equation has a nonzero solution 
X4,-..,X, in E, consider a solution with the fewest number of nonzero 
entries and, by renumbering if necessary, assume the nonzero entries to 
be x,,...,x,. Dividing by x, if necessary, we may also assume that 
x, = 1. Thus 


(5.2.2) X,@, + bio + X,-1% 1 + a, => 0 
Equation (5.2.2) is equivalent to 


(5.2.3) X,(oa,) +--+ +x,_)(ca,_,) toa, =0 
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for all o € H. In particular, for o equal to the identity map, we get 
XyQy Hee +X 10,1 +O, = 0 
which implies, owing to the independence of the a,’s, that not all of the 
x;’s can lie in F(H). Let us assume that x, ¢ F(H). 
Applying r € H to (5.2.3) gives 
7(x,)(T7a1) +++++7(x,_1)(Tea,_1) + Toa, = 0 
for all o € H. But as o varies over the subgroup H so does ro and so 
7(X)(oa,) +++ +7(x,_1)(7a,_1) +00, =0 
for all o,r € H, or equivalently, 
(5.2.4) (7x, )@, +--+ + (7x,_1)@,_1 +@, =0 


Since x, ¢ F(H), we may choose 7 €H such that 7(x,) # X,. Finally, 
subtracting (5.2.2) from (5.2.4) gives 


[(7x1) — x], +--+ + [(7@,_1) —x,_1]@,_, = 0 
which is shorter than (5.2.2). This contradiction completes the proof. ll 
Now we can show that the map H++F(H) is degree-nonincreasing. 


Theorem 5.2.5 Let F < E be algebraic and let H and J be subgroups of 


[F():F(H)] < (13) 
Proof. If (H:J) = oo there is nothing to prove, so let (H:J) =1< oo. 
Choose one o, from each coset of J in H, for i= 1,...,r. Let 
@4,...,@, € F(J) be linearly independent over F(H) and assume for the 


purposes of contradiction that n >r. The system 


Xy(@ 01) + X2(@201) + +++ +x, (4,01) = 0 


X,(@0,) + X9(@0,) Baa Xn(@y7,) =0 


has more unknowns than equations and so it has a nonzero solution 
X1,-..,X,, in E. Hence, there exist B,,.++58, € E, not all 0, such that 
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(5.2.5) By(@0;) +++ + B,(@,7;) = 0 


for all i=1,...,r. 
Now, any 7 €H has the form r = o;p where p € J. Since a; € F(J), 
we have pa; = a; and so 


ar = a;(o;p) aaa (0;p)(95) = O50; 7 ajo; 


Hence, it follows from (5.2.5) that 
Bye, eee Ys =0 


as a map on H. This contradicts the previous theorem, which says that 
the @;’s are independent over E. Hence n <r. 8 


Thus, the Galois correspondence of an algebraic extension F < E is 
indexed. As a consequence, we have the following theorem. 


Theorem 5.2.6 Let F <E be algebraic and let (11,2) be the Galois 
correspondence of F < E. Then (II,Q) is indexed. Hence 


1) [and QQ are degree-nonincreasing, order-reversing maps. 

2) II and Q are degree-preserving, order-reversing bijections (inverses 
of each other) between the lattice Ci(F) of closed intermediate 
fields of F <E and the lattice Cl(G) of closed subgroups of the 
Galois group Gp(E). More specifically, 

a) IfF <L<K<E with K, L closed then 


[K:L] = (G,(E):Gx(E)) 
b) IfJ CHC G,(E) with H, J closed then 
(H:J) = [F(J):F(H)] 


c) For K, L € CXF) and H, J € Cl(Q), we have 


Gx aL(E) = GK(E) V G(E), Gx yL(E) = Gx(E)N G,(E) 
F(HnJ) = F(H) V FJ), F(H.AJ) = F(H) F(J) 


In addition, any finite extension of a closed intermediate field or closed 
subgroup is closed. 0) 


We should note that the joins in part 2c) of the previous theorem are 
joins in the corresponding lattices. Thus, for instance, Gy(E) V G,(E) is 
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the smallest closed subgroup of G,(E) containing G,(E) and G,(E), 
which need not be the smallest subgroup of Gp(E) containing these 
groups. (In other words, Ci(¥) need not be a sublattice of ¥ and Ci(G) 
need not be a sublattice of §.) 


Corollary 5.2.7 If F < E is finite then | Gp(E) | < 0.0 


5.3 Who’s Closed? 


We turn our attention to the question of which intermediate fields of 
an extension and which subgroups of the Galois group are closed. 


Definition A normal separable extension is called a Galois extension. (] 


The next theorem follows from the relevant properties of normal and 
separable extensions. 


Theorem 5.3.1 

1) LetF<K<E. If F <E is Galois then K < E is Galois. 

2) The class of Galois extensions is closed under lifting: If F < E is 
Galois and F < K then K < EK is Galois. 

3) The class of Galois extensions is closed under arbitrary composites 
and intersections: If F < E, are Galois and VE, is defined then 
F < VE, is Galois and F < ()E; is Galois. 0 


It is not hard to describe the closed intermediate fields of an 
algebraic extension F < E. 


Theorem 5.3.2 Let F<E be algebraic and consider the Galois 
correspondence on F < E. 


1) An intermediate field K is closed if and only if K < E is a Galois 
extension. 
2) If K is closed and K < L << E then L is also closed. 
3) The following are equivalent. 
a) F is closed. 
b) F< Eis a Galois extension. 
c) All intermediate fields are closed. 


Proof. According to Theorem 4.7.4, if K <E is normal then ci(K) = 
F(G,(E)) = K"*, the purely inseparable closure of K in E. Hence, if 
K <E is Galois then cl(K) = K. For the converse, suppose that K is 
closed. Let a € E with p(x) = min(a,K) of degree n. Since [K(a):K] is 
finite, we know that K(q) is closed and 
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n= [K(a):K] = (G(E):Gx(4)(B)) 


Let o,,...,0,, be a complete set of coset representatives of GK(a) (E) in 
G,(E). It is easy to see that 7 Eo; iPK(a) (E) if and only if ae = 0a. 
Hence there are precisely n distinct images of a under the Galois group 
Gx(E). But each of these images is a root of the minimal polynomial 
p(x) and so p(x) is separable with all of its roots in E. Hence K < E is 
both separable and normal. This proves statement 1) and shows that 
3a) and 3b) are equivalent. All of the statements in 3) are equivalent 
since F < E is Galois if and only if K < E is Galois for all intermediate 
fields K. Similarly, 2) follows from 1). & 


Note that if F<E is algebraic then E is closed since E = F((e)) 
where 4 € G,(E) is the identity. 
If K is a closed intermediate field, then 


[E:K] = (G(E):Gg(E)) = | Gx(E) | 
In the finite case, the converse also holds. 


Theorem 5.3.3 Let F < E be a finite extension. 


1) An intermediate field K is closed if and only if [E:K] = | G,(E)|. 
2) The following are equivalent. 

a) F< E is Galois. 

c) F is closed. 

d) All intermediate fields are closed. 

e) [E:K] = | G,(E)| for all intermediate fields K. 

f) [E:F] = | Gp(E) |. 
Proof. We have seen that K closed implies [E:K] = | G,(E)|. 
Conversely, if [E:K] = | Gx(E)| then 


[E:K] = (G,(E):Gg(E)) = [F(Gg(E)):¥(Gq(E))] = [E-F(G,(E))] 


and so the finiteness of F < E implies that K = F(G,(E)), that is, K is 
closed. Part 2) follows from the previous theorem. # 


As for the matter of which subgroups are closed, let F< E be 
algebraic. Since the trivial subgroup Gp(E) = (¢) is closed, any finite 
subgroup of G,(E) is closed. Thus, if F < E is finite then Gp(E) is finite 
and all subgroups are closed. We may now give a complete answer to 
the question of who’s closed in the finite case. 


5 Galois Theory I 111 


Theorem 5.3.4 If F < E is finite then all subgroups of the Galois group 
Gp(E) are closed and an intermediate field K is closed if and only if 
K <E is a Galois extension. In particular, if F < E is Galois then all 
intermediate fields are closed. 0 


As the next example shows, in the general algebraic case, not all 
subgroups need be closed. 


Example 5.3.1 For this example, we borrow from a later chapter the 
fact that for any prime power p’, there exists a finite field GF(p*) of 
size p* and GF(p%) < GF(p*) if and only if d|r. 

Let F = Z, and let. E= F be an algebraic closure of F. Since F is a 
finite field, it is perfect and so F<E is separable. Since E is 
algebraically closed, F < E is normal. Hence F < E is a Galois extension 
and therefore F is closed. Let H=(¢,) be the subgroup of Gp(E) 
generated by the Frobenius map ¢,:a—a?. The fixed field F(H) is the 
set of all a € E for which aP =a, in other words, the roots in E of the 
polynomial p(x) = xP —x. But p(x) has p roots in F and so F(H) =F. 
It follows that 


cH) = Gruy(E) = Gp(E) 


Hence, all we need do is show that H # Gp(E) to conclude that H is not 
closed. 
Let q be a prime and consider the field 


P = GF(p3) U GF(p?) U GF(p* ) U= 


Then P is a proper subfield of E, since it does not contain, for instance, 
the subfield GF(p%+!). Hence [E:P] > 1 and since P <E is Galois, the 
group Gp(E) is not trivial. Let o € Gp(E). If o €H then o = ok for 
some k and so 


F((o)) = {a € E| oka = a} = {a CE| a” = a} 


is the set of roots in E of the polynomial x" — x. Hence F((c)) is finite. 
But F((c)) contains the infinite set P. This contradiction implies that 
o ¢H and so H # G,(E). 0 


The Galois correspondence begins with a field extension F < E and 
the corresponding Galois group G(E). We may also begin with a field 
E and a group G of automorphisms of E. Then we can form the fixed 
field 

F(G) = {a €E| oa = @ for all o € G} 
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and consider the Galois correspondence (I1,2) on F(G) < E. Since G is a 
subgroup of Gj,q)(E), it is in the domain of the map Q and so F(G) = 
Q(G). Hence, NC) is closed under the Galois correspondence and so 
F(G) <E is a Galois extension. If G is closed, which happens, for 
instance, when G is finite, then G F(a) (E) =G. 


Theorem 5.3.5 Let E be a field and let G be a group of automorphisms 
of E. Then the extension F(G)<E is Galois. If G is closed (for 
example, if G is finite) then G = Grqy(E)- 0 


5.4 Normal Subgroups and Normal Extensions 


If F <E is normal and K is an intermediate field, we know that 
K <E is also normal, but F < K need not be. However, we can neatly 
describe when F < K is normal in terms of Galois groups. (This is an 
example of the power and purpose of Galois theory.) 

Suppose first that F <K is normal (F <E need not be normal). 
Since any T € Gp(E) sends K onto itself, it follows that ror € Gy(E) 
for any o € G,(E), that is, Gy(E) is a normal subgroup of Gp(E), in 
symbols, Gy(E) « Gp(E). 

Conversely, suppose that Gy(E)<4Gp(E). We want to show that 
F < K is normal. Let a € K have minimal polynomial p(x) over F. If 8 
is any other root of p(x), then Theorem 2.8.4 implies the existence of a 
t € Hom,(E,E) such that ra = f. If F <E is normal, then r € Gp(E). 
If o € Gy(E), the normality of G,(E) implies that or = To’ for some 
o' € G,(E) and so 


oB=oTa=T0'a=Ta=Pf 


Thus, o fixes all of the roots of p(x) and so all of the roots of p(x) lie in 
F(G,(E)). If K is closed, then all of the roots of p(x) lie in K and so K 
is normal over F. We have proven most of the following. 


Theorem 5.4.1 Let F< K < E. 


1) If F <K is normal then G,(E) 4 Gp(E). 

2) If F<E is normal, K<E is Galois and Gy(E)4Gp(E) then 
F < K is normal. 

3) If F<E is Galois then F<K is normal if. and only if 
Gx (E) 4 Gp(E). 


Moreover, if F <K and F <E are normal, the map ¥:G,(E)—>G,(K) 
defined by 
poo | 
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is an epimorphism whose kernel is Gy(E). Thus, 


Gp(K) ~ hs 


Proof. We need only prove the last statement. Let o € Gp(E). Since 
F <K is normal, the restriction o |, being an embedding of K into E 
over F, is an automorphism of K and thus lies in Gp(K). Hence # maps 
Gp(E) to Gp(K). Moreover, for 7, rT € Gp(E), we have 


(or) lk =o(7 1K) = (71 KT Ik) 


which shows that ~ is a group homomorphism. The kernel of 7 is 
Gy(E) since if o€ Gp(E) then o|y =« if and only if o € Gy(E). 
Finally, the map yw is surjective since the normality of F < E implies 
that any o € Gp(K) can be extended to an element of G;(E), whose 
restriction to K is o. § 


5.5 More on Galois Groups 


We now examine the behavior of Galois groups under lifting and 
under the taking of composites. We assume that all composites 
mentioned are defined. 


Theorem 5.5.1 (The Galois group of a lifting) Let F < E be Galois and 
let F<K. Then K<EK is Galois. Moreover, the restriction map 
p: Gy (EK)— Gy 4 p(E) defined by Yo = | 5 is an isomorphism. Thus 


Gy(EK) ~ Gx a p(E) 
In addition, 


1) KNE=F implies Gg(EK) ~ Gp(E). 
2) If F <Eis finite, then G,(EK) © Gp(E) implies KN E = F. 


Proof. We have already seen that K < EK is Galois. The normality of 
F <E implies that y is a homomorphism from Gy(EK) into Gy ,p(E). 
If o € Gy(EK) and o |; = then o fixes E as well as K and so it fixes 
all elements of EK, whence o =1. Thus > is injective. It remains to 
show that Imy = Gy qp(E). 

To avoid confusion, let us use the notation Fp(-) for the fixed field 
with respect to the Galois correspondence on F < E, and Fpy(-) for the 
fixed field with respect to the Galois correspondence on K < EK. Since 
K < EK is Galois, we deduce that K is closed with respect to the Galois 
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correspondence on K < EK and so 


Fy(Im 4) = {a € E| ra =a for all r € Im(¥)} 
= {a€E| (o|,)a =a for all o € Gy(EK)} 
= {a€E| oa =a for all o € Gy(EK)} 
= EN Fex(Gx(EK)) 
=ENK 


Now, if we show that Imyw is closed with respect to the Galois 
correspondence on F < E, it follows by taking Galois groups that 


Im $= Gx nR(E) 


and thus 7 is surjective, completing the proof. If F < E is finite, then all 
subgroups of the Galois group G,(E) are closed, and we are finished. 
We will postpone the proof in the infinite case until we have discussed 
the Krull topology, later in this chapter. 

Finally, statement 1) is clear. As to statement 2), we have 
Gp(E) ~ Gg pp(E) and since Gy, p(E) < Gp(E) with both finite, we 
deduce that Gy ,p(E) = G,(E), whence KNE=F follows by taking 
fixed fields. 


Theorem 5.5.1 yields a plethora of useful statements about degrees, 
all of which can be read from Figure 5.5.1. We leave details of the proof 
to the reader. [Part 3) of the next result is particularly useful.] 


Figure 5.5.1 


Corollary 5.5.2 Suppose that F < E is finite Galois and F < K, with EK 
defined. Then 


1)  [EK:K] =[E:ENK] and [EK:K] | [E:F]. 
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If F < K is also finite then 
2) (EK:F] = [E:EN K][K:F] and [EK:F] | [E:F][K:F]. 
3) [EK:F] = [E:F][K:F] if and only if ENK =F. 


More generally, if F < E; is finite Galois for i=1,...,.n—1 and F< E, 
is finite then 


n 
4) (Ey -E,:F] = | [ (EB, (Ej41---E)) 
i=l 
where Eiyy --E,, = F when i=n. 


n 
5) (E,--E,:F) = [[[E;F] if and only if E, 9 (E,,,---E,) = F for alli, 
i=1 
where BE; ,4°--E, = F wheni=n. 0 


We now turn to the Galois group of a composite. 


Theorem 5.5.3 (The Galois group of a composite) Let ¥ = {E; |i € I} 
be a family of fields, all contained in a larger field. If F < E; is Galois 
over F for all i€I, then the composite VE, is Galois over F. If G = 
I] Gp(E;) is the direct product of the Galois groups G,(E;) and if 
1;:;G— Gp(E;) is projection onto the i-th coordinate, then the map 


$:Gp( V E;)> T] Gp(E;) 
defined by 
7(vo) =o|p, 


is a monomorphism of groups. Hence, Gp(VE;) is isomorphic to a 
subgroup of the direct product |] Gp(E;). 

Moreover, if ¥ = {E,,...,E,} is a finite family of finite extensions, 
then the following are equivalent 
1) is an isomorphism and 


Gp(Ey Mery. E,) > Gp(E,) Xerox Gp(E,) 


2) EB, A(Ej4,°-E,) =F for alli=1,...,n. 


Proof. Let K= VE,. We have already seen that F < K is Galois. Let 
o € G,(K). Since each F < E, is normal, we have o |p, € Gp(E;). If 7 € 
Gp(K) then : 


m(¥(0T)) = (or) lg, = (7 Leg) = mde) (or) = mL(Yo)(¥7)] 


and so (or) = (yo)(~r). Thus, ~ is a homomorphism of groups. If 
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o\|,, =+ for all i EI, then since each element of K is a rational function 
(ovet F) in finitely many elements of UE;, we must have o =z, whence 
p is injective. 

When F is a finite family of finite extensions, all Galois groups are 
finite. It follows that 


|Imyp| = | Gp(VE,)| =[VE;:F] 
and 
| TT Gp(E;) | = TT | Gp(E) | = TE: F] 


Hence y is surjective if and only if [VE;:F] = [][E,:F] and Corollary 
5.5.2 gives the desired result. § 


The following corollary will prove useful. 


Corollary 5.5.4 Suppose that F <E is a finite Galois extension with 
Galois group of the form 


G = G,(E) = G,x:--xG 


n 


If 
H, = Gy x+++ x {u} x++°xG, 


where {z} is in the i-th coordinate and if E, = F(H;) then 


1) F<E; is Galois with Galois group Gp(E;) ~ G;, 
2) E=E,V+-VE,, 
3) E,N(E,4,°:-E,) =F for alli=1,...,n. 


Proof. Since F < E is finite and Galois, all intermediate fields and all 
subgroups of G are closed. Since H; 4G, it follows from Theorem 5.4.1 
that F < E; is a Galois extension and 


In addition, F < VE; is Galois. Since 


Gyp(E) = 1 Ge(E) = 1H, = {+} = p(B) 


taking fixed fields gives VE;=E. Hence, Gp( VE,) ~ [] Gp(E;) and 
Theorem 5.5.3 implies that E;(E,,,---E,) =F for alli=1,...,n. 8 


5 Galois Theory I 117 


Abelian and Cyclic Extensions 


Extensions are often named after their Galois groups. Here is a very 
important example. 


Definition An extension F < E is abelian if it is Galois and if the Galois 
group Gp(E) is abelian. An extension F < E is cyclic if it is Galois and 
if the Galois group Gp(E) is cyclic. 0 


The basic properties of abelian and cyclic extensions are given in the 
next theorem, whose proof is left as an exercise. 


Theorem 5.5.5 


1) If F <E and F < K are abelian, then F < EK is abelian. 

2) If F<E is abelian (cyclic) and F < K, then K < EK is abelian 
(cyclic). 

3) If F<K<E with F <E abelian (cyclic), then F< K and K<E 
are abelian (cyclic). 0 


*5.6 Linear Disjointness 


If F <K and F <L are finite extensions, the degree [KL:F] provides 
a certain measure of the “independence” of the extensions. Assuming 
that [K:F] < [L:F], we have 


[L:F] < [KL:F] < [L:F][K:F] 


The “least” amount of independence occurs when [KL:F] = [L:F], or 
equivalently, when K <L and the “greatest” amount of independence 
occurs when 


(5.6.1) [KL:F] = [K:F][L:F] 


We have seen (Corollary 5.5.2) that, if one of the extensions is Galois, 
then (5.6.1) holds if and only if KNL=F. For finite extensions in 
general, we cannot make such a simple statement. However, we can 
‘express (5.6.1) in a variety of useful ways. For instance, (5.6.1) holds 
for arbitrary finite extensions if and only if whenever {a;} CK is 
linearly independent over F and {§;} CL is independent over F then 
{a; B; } is also independent over F. 

To explore the situation more fully (and for not necessarily finite 
extensions), it is convenient to employ tensor products. (All that is 
needed about tensor products is contained in Chapter 0.) 
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Let F <K and F <L. The multiplication map o:K x L—KL defined 
by o(a,8) =a is bilinear and so there exists a unique linear map 
v:K @ L>KL for which (a ® 8) = af. (The tensor product is over F.) 
This map is a morphism of F-algebras, since 


$l(a @ B)(7 @ 6)] = Yay @ 5) = a7f5 = (af)(75) = Y(a ® B)V(7 @ 6) 


Note that the image of ~ is the F-algebra K[L] = L[K] of all elements 
of the form 
k,é, tee tke, 


for k, € K and é; € L. Hence, if F < K or F < L is algebraic, then KL = 
K[L] and so the map 7 is surjective. 

If F is a field, we use the term F-independent to mean linearly 
independent over F. 


Theorem 5.6.1 Let F < E and suppose that K and L are intermediate 
fields. The following are equivalent. 


1) The linear map yp defined above is injective. 

2) If {a,;} C K is F-independent then it is also L-independent. 

3) If {o,} CK and {8;} CL are both F-independent then {a,G;} is 
also F-independent. 

4) If {a;} is a basis for K over F and {;} is a basis for L over F 
then {a;G;} is a basis for K[L] over F. 

5) There is a basis for K over F that is L-independent. 


If F < K and F < L are finite, then each of 1) to 5) is equivalent to 
6)  [KL:F] = [K:F][L:F]. 


If F < K and F <L are finite and one is Galois, then each of 1) to 6) is 
equivalent to 


7) KNOL=F. 


Proof. [1=>2] Let {a;}CK be F-independent and suppose that 
> 4a; = 0 for &; € L. Since y is a monomorphism and 


(226 8a) = Vi &a;, =0 
we have 


48a; =0 


Theorem 0.8.2 now implies that €; = 0 for all i. 
[2=>3] Let {a;} and {9;} be F-independent. If 
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do aieiFj =0 
iJ 


with a;,€ F then since {a;} is also L-independent, the coefficients of a; 


must equal 0, that is, 
a es 
j 


for all i. Since the (;’s are also F-independent, we get a; = 0 for all i, 

[34] This follows from the fact that if {a;} spans K over F nd 
{6;} spans L over F then {o;8;} spans K[L] over F. 

{4=51] The map 7 sends a basis {a; ® 8;} for K @L to a basis {a;f;} 
for K[L] and is therefore injective. 

Thus, each of 1) to 4) is equivalent, and by symmetry we may add 
the equivalent statement that any F-independent subset of L is also K- 
independent. It is clear that 2) implies 5). 

[52] Let {a;} be a basis for K over F that is L-independent. Let 
{8;} be an F-independent subset of L. We show that {(.} is also K- 

independent. Let 5° j4;0; =0 where «,€K. Then «; = ya, .a., where 


Py 
a; € F, and so 
> Laisiei=0 
But the a;’s are L-independent and so 
>a; = 0 
j 


for all i. Hence a; = 0 for all i, j. It follows that «; = 0 for all i, whence 
{B;} is K-independent. 

fie6] In the finite (hence algebraic) case, we have remarked that the 
map w:K @ L-KL is surjective and so it is also injective if and only if 
dim K @L = dim KL, which by Corollary 0.8.5 is equivalent to 


(dim K)(dim L) = dim KL 


all dimensions being over F. 
[6<>7] This follows from Corollary 5.5.2. § 


Definition If any of the equivalent conditions hold in Theorem 5.6.1, we 
say that K and L are linearly disjoint over F. 0 
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*5.7 The Krull Topology 


We have seen that if F< E is a finite Galois extension, then all 
subgroups of the Galois group Gp(E) are closed but if F < E is infinite 
and Galois, this need not be true (see Example 5.3.1). The use of the 
term closed suggests the presence of a topology, which we now define. 


Definition Let E™ be the set of all functions from E into E. We define a 
topology J on EE, called the finite topology, by specifying as subbasis 
all sets of the form 


Su,y = {£EE | fu = v} 
where u, v € E. A basis for J thus consists of all sets of the form 
{fE—-E | fu, = v,,..., fu, = v,} 
where u,, v; € BE. 0 
Of course, if F < E, then the Galois group Gp(E) is a subset of EE, 
Theorem 5.7.1 If F <E is algebraic then G,(E) is closed in the finite 


topology. 


Proof. We show that any f € EF that lies in the closure cl( G,(E)) of the 
Galois group is actually in Gp(E). A basic open neighborhood of f has 
the form 


{g € EF | gu, = fu,,...,gu, = fu,} 
and so f € cG,(E)) implies that for any uj,...,u, € E there is a o € 
G,(E) for which ou;=fu; for i=1,...,k. It follows that f is a 


homomorphism. For if u, v € E and a, # € F then there is a o € Gp(E) 
for which 


ou=fu, ov= fy, 
o(au + Bv) = f(au + Bv), o(uv) = f(uv) 
Hence, 


f(au + Bv) = o(au + Bv) = acu + Bov = afu + Biv 
and 


f(uv) = o(uv) = (cu)(ov) = (fu)(fv) 


which shows that f is a homomorphism. Also, fu = 0 implies cu = 0 for 
some o € Gp(E) and so u=0, showing that f is injective. Similarly, f 
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fixes F pointwise. Thus, f is an embedding of E into itself over F. Since 
F < E is algebraic, we deduce that f € Gp(E). § 


Thus, if F<E is a Galois extension the Galois group G)(E) is 
closed in the finite topology on EE. The subspace topology inherited by 
G,(E) is called the Krull topology on Gp(E). It follows that a subset of 
G,(E) is closed in the Krull topology if and only if it is closed in the 
finite topology on EE, 

To avoid any temporary confusion, we refer to a subset of Gp(E) 
that is closed in the Krull topology as k-closed and a subgroup of Gp(E) 
that is closed in the sense of the Galois correspondence as g-closed. 
Similarly, we use the term k-open for open sets in the Krull topology. 

Let us determine the closure H in the Krull topology of a subgroup H 
of Gp(E). If 7 € H then given uj,...,u, € E, there is a o € H for which 
Tu; = ou;, for i=1,...,n. This implies that 7 fixes any element of the 
fixed field F(H). Hence, 7 € H if and only if, given u,,...,u,, € E, there 
is a o EH for which 


r| F(H)(u,,...,u,) ~ 7 | F(H)(u,,..-,u,) 


Since any finite extension of F(H) contained in E has the form 
F(H)(u,,...,u,,), we can say that rH if and only if for any finite 
extension K of F(H) contained in E, there exists a o € H for which 
TIk=7lK 

If F(H) < K is a finite extension and K"° is the normal closure then 
F(H) < K"° is a finite Galois extension. Thus 7 € H if and only if for 
any finite Galois extension K of F(H) contained in E, there exists a o € 
H for which 7 |, =o |. Finally, letting 


Hlx ={olx: 0 € H} 
we can say that 7 € H if and only if for any finite Galois extension K of 


F(H) contained in E, we have r|, €H| x. 
If 7 € H and K = F(H), we have 


and so rE G FH) (E)s the g-closure of H, whence 
HC Gyyy(E) 


To see that the reverse inclusion holds, suppose that r € G F(n)(E) and 
let. F(H) < K be a finite Galois extension contained in E. Since F(H) is 
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contained in K, we have 


F(H) = {a € E| oa = a2 for all o € H} 
= {a € K|oa =a for all o € H} 
= {a€K|ca=a for all co €H| x} 
= FH |x) 


where F(H|,) is the fixed field of H|, with respect to the Galois 
correspondence on the Galois extension F(H) <K. (Note that since 
F(H) < K is a Galois extension, if o € H then o is an automorphism of 
E over F(H), whence its restriction o |, is an automorphism of K over 
F(H). Hence, H| , is contained in the Galois group G,, y)(K).) 

Since F(H) = F(H|,), the extension F(H | ,) ek is finite and 
Galois, implying that H|, is g-closed in the Galois correspondence of 
F(H | x) < K. Hence, 


and so 7 € H. It follows that G F(H)(E) CH. Let us summarize. 


Theorem 5.7.2 Let F < E be a Galois extension and let H be a subgroup 
of the Galois group Gp(E). Then the closure G4;)(E) of H with respect 
to the Galois correspondence on F < E is the closure of H in the Krull 
topology. 0 


Let F < E be a Galois extension. We leave it to the reader to show 
that the composition map 


Gp(E) x Gp(E) Gp(E):(0,7) Hor 
and the inversion map 
Gp(E)HGp(E):o+07} 


are continuous under the Krull topology. Hence, G;(E) is a topological 
group. In fact, it can be shown that Gp(E) is a compact, totally 
disconnected topological group. 

We conclude this section by completing the proof of Theorem 5.5.1 
in the infinite case. Recall that F < E is Galois and F < K. The map 
:G,(EK)—G,(E) is defined by Yo =o |p and we wish to show that 
Im yw is closed with respect to the Galois correspondence on F < E. 
Theorem 5.7.2 implies that this is equivalent to showing that I= Im» 
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is closed in the Krull topology on Gp(E). 

Let 7 €I, the Krull-closure of I. We show that 7 €I by finding aa € 
Gy(EK) for which o |; = 7. Let us define o:EK—EK as follows. Since 
K < EK is algebraic, any element a € EK is a finite sum of the form 


a= ')le;k; 
where e; € E and k; € K. We set 
oa = Fo (rej)k; 
The first order of business is to show that this is well-defined. 
To this end, note that since 7 €I, it follows that for any finite set 
U = {u,,...,u,} CE, there exists a oy € Gy(EK) that agrees with 7 on 
the elements of U, that is, for which 
ayu; = Tu;, for alli 
Hence, if U = {e,,...,e,} then 
oa = Yi (reiki = Li (oyeikj 
Now suppose that a can also be written as 
a= \ ek; 
Let V = {ef} C E and let oy yy agree with 7 on UUV. Then 
LU (reky = L (euuvedki = uv v( > ek;) 
= oyuv( Deki) = Ceyuvepki = Xi (rekt 


Thus, the definition of oa does not depend on the representation of a, 
and o is well-defined. 
Now suppose that 


a= Vek; a= dD eaik; yeoey AQ = = Lenk; i 


is any finite set of elements of EK and let U = {e;;}. If o’ € Gx(EK) 
agrees with r on the elements of U, then 


o'o, = 0” Del = = % 7(e;)k, = Pe = 00; 
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for all j = 1,...,n. In other words, for any finite subset S of EK, there is 
an element of G,(EK) that agrees with o on S. 

It follows that o is a homomorphism of EK, for if a, 6 € EK then 
there exists a o’ € Gy(EK) that agrees with o on a, 8, a+ and af 
and since o’ is a homomorphism, we have 


o(a+ B) =o'(a+ B) =0'a+o0'B =cat+oB 


o(af) = o'(af) = (0'a)(0'8) = (a)(0f) 


and 


It also follows that o is injective, for if oa =0 then there is a o' € 
G,(EK) such that o’a = 0, whence a = 0. The surjectivity of o follows 
from that of r, since if a € EK, then 


a= Dek; = \(ref)k; = o Xefk) 


Finally, it is clear from the definition that oa = a for all a € K and 
that ca=rTa for all a€E. Thus, o € Gy(EK) and o|,=7. This 
completes the proof of Theorem 5.5.1. 


Exercises 

1. If A:2LAb is an order reversing bijection between two lattices, 
verify that \(a A b) = Aa V Ab and X(aV b) = AaA Ab. 

2. With respect to a Galois connection, if P is a complete lattice then 
CP) is also a complete lattice. 

3. If K<E and L<E are Galois extensions, show that KNL <E is 
a Galois extension. 

4. Let K and L be subfields of a field E and suppose that K < E and 
L < E are Galois, with Galois groups G, and Go, respectively. Let 
GG, be the join of G, and G, in the lattice § of all subgroups of 
Gx mp (E) and let G, V Gy be the join of G, and G, in the lattice 
G of all closed subgroups of Gy ,1(E). Show that G,G, is finite if 
and only if Gx ,,(E) is finite, in which case G,G, = G, VG,. 

5. Let F<E be finite with G = G,(E). Let G, 4G, <G, with F; = 
F(G;). Show that Gp (F,) ~ G,/G,. 

6. Find an example of “an infinite algebraic extension whose Galois 
group is finite. 

7. Prove Corollary 5.5.2. 

8. Let F be a perfect field. Suppose that there is a prime p for which 
p|{E:F] for every proper finite extension E of F. Show that if E is 
a finite extension of F then [E:F] = p” for some n EN. Apply this 
to the case F = R to deduce that if R < E is a finite extension the 
[E:R] = 2” for some n EN. 
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10. 


11. 


12, 


13. 


14. 


15. 


16. 


17. 


18. 


Let F < E be a finite Galois extension and let E < K. Then [EK:K] 

divides [E:F]. Use the following to show that the assumption that 

F <E be Galois is essential. Let a be the real cube root of 2, let 

w #1 be a cube root of 1. Let F = Q, E = Q(aw) and K = Q(a). 

Prove the following statements about abelian and_ cyclic 

extensions. 

a) If F <E and F < K are abelian, then F < EK is abelian. 

b) If F <E is abelian (cyclic) and F < K, then K < EK is abelian 
(cyclic). 

c) If F<K<E with F<E abelian (cyclic), then K<E and 
F < K are abelian (cyclic). 

Let F < E and F < K be extensions, with E and K contained in a 

larger field. Show that E and K are linearly disjoint over F if and 

only if E’ and K’ are linearly disjoint over F for all intermediate 

fields F < E’ < E and F < K’ < K with [E’:F] and [K’:F] finite. 

Let F < E be a normal extension. Show that the separable closure 

F°° of F in E and the purely inseparable closure F'° of F in E are 

linearly disjoint over F. Moreover, if F < K < E and if K and F"* 

are linearly disjoint over F then F < E is separable. 

Let f(x) € F[x] and let F < E. Let Sp be the splitting field of f(x) 

over E. Thus, if a,,...,@, are the roots of f(x) in Sp, we have 

Sp = E(a,,...,0,). Let Sp=F(a,,...,¢,) and let L= 

Sp nN F( Gz(Sp))- Let v:Gp(Sp)— G, (Sp) be defined by yo =o | So’ 

Show that 7 is an isomorphism. This is known as the Theorem on 

Natural Irrationalities. 

Referring to Theorem 5.5.3, show that if F is an arbitrary family 

then the map y is an isomorphism if 


Bn( iY E)=F for all j EI 


Extend the notion of closure obtained from the Galois extension to 
all subsets of Gp(E), and show that it is a closure operation in the 
sense of topology. 

Prove that Gp(E) is a topological group under the Krull topology. 

Show that this topological group is totally disconnected. 

Let F<E and suppose that S is a finite set of elements 

algebraically independent over E. Then F(S) and E are linearly 

disjoint over F. 

a) Show that in every Galois extension F < E, there is a largest 
abelian subextension F#, that is, F< F#><E, F< F® is 
abelian and if F < K < E with F < K abelian then K < F®. 

b) If G is a group, the subgroup G’ generated by all 
commutators ofa 'p-', for a, BEG, is called the 
commutator subgroup. Show that G' is the smallest subgroup 
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19. 


20. 


5 Galois Theory I 


of G for which G/G’ is abelian. 

c) If the commutator subgroup Gp(E)! of a Galois group G(E) 
is closed, that is, if Gp(E)’ = Gy(E) for some F < K < E, then 
K=F*. 

Let F < K and let F< E< L. Assume that K and L are contained 

in a larger field. Then K and L are linearly disjoint over F if and 

only if K and E are linearly disjoint over F and KE and L are 

linearly disjoint over E. 

The following concept is analogous to, but weaker than, that of 

linear disjointness. Let F <K and F <L be extensions, with K 

and L contained in a larger field. We say that K is free from L 

over F if whenever SCK is a finite set of algebraically 

independent elements over F, then S is also algebraically 

independent over L. 

a) The definition given above is not symmetric, but the concept 
is. Show that if K is free from L over F, then [KL:L], = [K:F],. 
Let T be a finite F-algebraically independent set of elements 
of L. Show that T is algebraically independent over K. 

b) Let F <K and F <E be field extensions, contained in a larger 
field. Prove that if K and L are linearly disjoint over F, then 
they are also free over F. 

c) Find an example showing that the converse of part b) does 
not hold. 


Chapter 6 
Galois Theory II 


In this chapter, we pass from the highly theoretical material of the 
previous chapter to the somewhat more concrete, where we consider the 
Galois groups of the splitting fields of specific types of polynomials. 


6.1 The Galois Group of a Polynomial 


The Galois group of a polynomial p(x) € F[x] is defined to be the 
Galois group of a splitting field S for p(x) over F. This group is 
sometimes denoted by Gp(p(x)). If 


p(x) = py!(x): + pyk(x) 


is a factorization of p(x) into powers of distinct irreducible polynomials 
over F, then S is also a splitting field for the polynomial q(x) = 
P;(x)-+-p,(x). Moreover, the extension F <S is separable (and hence 
Galois) if and only if each p;(x) is a separable polynomial. In particular, 
if p(x) has no multiple roots, then F < S is a Galois extension. 

Note that each o € G,(S) is uniquely determined by its action on the 
roots of p(x), which generate S, and that this action is a permutation of 
‘the roots. However, not all permutations of the roots of p(x) need 
correspond to an element of G,(S). Thus, we have an injective group 
homomorphism from Gy(S) into the symmetric group S,, where n = 
deg p(x). 

Let p(x) = f(x)g(x) where deg f(x) >0 and let S, be the splitting 
field for p(x) over F and Sy the splitting field for f(x) over F. We clearly 
have F<S;< Sp with each step normal. Hence, by Theorem 5.4.1, 
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Gs,(S,) 4 Gp(S,) and 
Gp (S,) 


Gp(S-) = Gs,(S,) 


or, in another notation, 


Gp(f(x)) ~ pa 
f 


Thus, the Galois group of a nontrivial factor of p(x) is isomorphic to a 
quotient group of the Galois group of p(x). 


6.2 Symmetric Polynomials 


If F is a field and t,,...,t,, are algebraically independent over F, the 
polynomial 


g(x) = Ie-¥) 


is referred to as a generic polynomial over F of degree n. Since the roots 
t,,...,t, of the generic polynomial g(x) are algebraically independent, 
this polynomial is, in some sense, the most general polynomial possible. 
Accordingly, it should (and does) have the most general Galois group, 
as we will see. 

The generic polynomial can be written in the form 


g(x) =x" 5x97} $+ + (-1)Psy 


where the coefficients s, € F(t,,...,t,) are given by 


n 
8, =tyt--+t,, = )otity,...,s,= [[t, 
i<j i=l 
and are called the elementary symmetric polynomials in the variables t,. 
It follows that the coefficients of any polynomial are the elementary 
symmetric functions of the roots (in a splitting field) of that 
polynomial. 

Since F(t,,...,t,) is the splitting field for g(x) over F(s,,...,8,), and 
since g(x) has no multiple roots, we deduce from the remarks of the 
previous section that the extension F(s,,...,8,) < F(t,,...,t,) is Galois 
of degree at most n!. Moreover, any permutation o €S, of {1,...,n} 
induces a unique automorphism of F(t,,...,t,,) defined by 
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Z P(ty-ssoba) Pltg(z)r++-rt ony) 
q(t,,...,t,) A(to(ryr+++> tony) 
Let us denote the group of all such automorphisms by G. 
According to Theorem 5.3.5, since G is a finite group of 


automorphisms of F(t,,...,t,), the extension F(G) < F(t,,...,t,) is 
finite and Galois, with Galois group G and so 


[F(t,,...,t,):F(G)] = |G| = |S,| =n! 


Since every elementary symmetric function is fixed by the elements of 
G (hence the name symmetric function), we have 


F(s,,...,8,) < F(G) < F(t,,...,t,) 
and since 


[F(t,,...,t,):F(s,,...,8,)] <n! 
we have equality above and F(G) = F(s,,...,8,): 


Theorem 6.2.1 Let t,,...,t,, be algebraically independent over F and let 


S,,---,8, be the elementary symmetric functions in t,,...,t,,. 
1) F(s,,...,8,) < F(t,,...,t,) is a Galois extension of degree n!, 
whose Galois group is isomorphic to the symmetric group S,.. 

2) The generic polynomial g(x) is irreducible over F[s,,...,s,]. 


Proof. To prove part 2), observe that if g(x) =a(x)b(x) where 

deg a(x) =d >0 and deg b(x) =e>0, then the Galois group of g(x) 

would have size at most d!e! < (d +e)! =n!. Hence g(x) is irreducible. & 

Definition A polynomial p(t,,...,t,,) € F[t,,...,t,] is symmetric if 
P(tgc1s oe tony) = P(t, eeey t.,) 

for all permutations o € S,. Equivalently, p is symmetric if 

o[p(t,,...,t,)] = p(t,,...,t,) 
for allo €G.0 


Thus, a polynomial p(t,,...,t,) € F(t,,...,t,) is symmetric if and 
only if it lies in the fixed field F(s,,...,8,), that is, if and only if it is a 
rational function in s,,...,8,. However, we can improve considerably 
upon this statement. 
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Theorem 6.2.2 Let t,,...,t,, be algebraically independent over F and let 
$1,---,8, be the elementary symmetric functions in t,,...,t,. A 
polynomial p(t,,...,t,) € F[t,,...,t,] is symmetric if and only if there 
exists a polynomial q(x,,...,X,) over F for which p(t,,...,t,) = 
q(s},.--,8,). Moreover, if p(t ,...,t,) has integer coefficients, then 
q(x,,.-.,X,,) can be chosen with integer coefficients. 


Proof. If p(t,,...,t,) has the form q(s,,...,s,), then it is clearly | 
symmetric. For the converse, the proof consists of a procedure that can 
be used to construct the polynomial q(x,,...,x,). Unfortunately, while 
the procedure is quite straightforward, it is recursive in nature and not 
at all practical. 

We use induction on n. The theorem is true for n = 1, since s, = t,. 
Assume the theorem is true for any number of variables less than n and 
let p(t,,...,t,,) be symmetric. By collecting powers of t,,, we can write 


p(ty,..-,t,) = Po t+ Pity + Pot? +--+ +p, th 


where each p; is a polynomial in t,,...,t,_;. Since p is symmetric in 
t,,.-.,t,_, and t,,...,t,, are independent, each of the coefficients p, is 
symmetric in t,,...,t,_,- By the inductive hypothesis, we may express 
each p; as a polynomial in the elementary symmetric functions on 
t,,---,t,_,- If these functions are denoted by u,,...,u,_;, then we have 


(6.2.1) P(ty,..-yt,) = 9 +4, t, + aot? +--+ +q,t2 


where each q; is a polynomial in u,,...,U,_4, with integer coefficients if 
p has integer coefficients. 

Note that the symmetric functions s; can be expressed in terms of the 
symmetric functions u; as follows 


8; =u, +t, 
So =Ugt+ u,t, 
(6.2.2) : 
Sy—1 = Un-1 tUp_aty 
8, = Unity 
These expressions can be solved for the u,’s in terms of the s,’s, giving 
u, =s,—-t, 


a, pom 2 
Ug = Sq —u,t, — Sy — St, +t, 
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Ug = 83 — Ut, = 83 — Sot, + syti—t) 


Un—1 = Sn—1 ~ Un—2tn = Spa — Spat, Ho + (-1) A 
and from the last equation in (6.2.2), 
(6.2.3) 0 =s, —u,_it, = 8, —Sp_yty $o°° + (-1)7t2 
Substituting these expressions for the u,’s into (6.2.1) gives 
P(ty,---st,) =To HMyt, ryt? +--+ +4102 
where each r,; is a polynomial in s,,...,s,_; and t,, with integer 


coefficients if p has integer coefficients. Again, we may gather together 
powers of t,, to get. 


P(ty,-.-,t,) _ Bot Stn + Bote + cats +¢.tn 
where each g; is a polynomial in s,,...,8,_,, with integer coefficients if 


p has integer coefficients. If m >n, we may reduce the degree in t,, by 
using (6.2.3), which also introduces the term s,. Hence, 


(6.2.4) P(t,,---,t,) = ho +hyt, + hot? +---+h,_ th 
where each h, is a polynomial in s),...,8,, with integer coefficients if p 
has integer coefficients. 
Since the left side of (6.2.4) is symmetric in the t;’s, we may replace 
t,, by t;, for each i= 1,...,n—1, to get 
p(t,,---yt,) =hp +hyt; + hot? +---+h,_ te} 
valid for all i= 1,...,n. Hence, the polynomial 


P(x) = hg + hyx + hx? + +++ + hy"! — p(tyy--+sta) 


has degree (in x) at most n—1 but has n distinct roots t,,...,t,, 
whence it must be the zero polynomial. Thus, h;=0 for i>1 and 


p(t,,.-.,t,) = hg = ho(s),...,8,), as desired. & 


Example 6.2.1 Let. p(x) =x"—s,x"-1+4---+(-1)"s, be a polynomial 
with roots r,,...,1, in a splitting field. For k > 1, the polynomials 


Net en sear 
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are symmetric in the roots of p(x), and so Theorem 6.2.2 implies that 
they can be expressed as polynomials in the elementary symmetric 
functions s,,...,8, of the roots. One way to derive an expression 
relating the u,’s to the s,’s is by following the proof of Theorem 6.2.2. 
In the exercises, we ask the reader to take another approach to obtain 
the so-called Newton identities 


Ue — Uj 784 + Uye_98q + +++ (=1)* Muy 5_ 1 + (-1)Kks, = 0 


for k > 1. These identities can be used to compute recursively the u,’s 
in terms of the s,’s. 0 


Since any symmetric polynomial in the roots of a given polynomial 
p(x) is a polynomial in the coefficients of p(x) as well, it therefore lies 
in the base field. 


Corollary 6.2.3 Let: p(x) € F[x] have roots r,,...,1,, in a splitting field. If 
f(t,,...,t,) is a symmetric polynomial, then f(r,,...,1,,) is a polynomial 
in the coefficients of p(x), and thus lies in F. 


Proof. We know that f(r,,...,1,) =g(S,,.-.,8,) where s; is the i-th 
elementary symmetric polynomial in the roots r,,...,1,. But s; or —s; is 
the coefficient of x"~* in p(x), whence f is a polynomial in these 
coefficients. &f 


Theorem 6.2.4 The elementary symmetric polynomials s,,...,8 


n are 
algebraically independent over F. 


Proof. Since F(s,,...,8,) < F(t,,...,t,) is algebraic, Theorem 3.3.1 
implies that S = {s,,...,8,} contains a transcendence basis for 
F(t,,...,t,) over F. But {t,,...,t,} is a transcendence basis and so 
[F(t,,...,t,,):F], =m. Hence, S is a transcendence basis. § 


6.3 The Discriminant of a Polynomial 


We have seen that the Galois group Gp(p(x)) of a polynomial of 
degree n is isomorphic to a subgroup of the symmetric group S, and 
that the Galois group of a generic polynomial is isomorphic to S,, itself. 
A special symmetric function of the roots of p(x), known as the 
discriminant, provides a useful tool for determining whether or not the 
Galois group is isomorphic to a subgroup of the alternating group. 

Let p(x) be a polynomial over F, with roots r,,...,r,, in a splitting 
field E. Let 
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6= II (r; = T;) 
1<)j 
The discriminant of p(x) is A = 6”. Note that A #0 if and only if p(x) 
has no multiple roots. 
Let us assume that A#0. Hence p(x) is the product of distinct 
separable polynomials, implying that F < E is a Galois extension. Each 
o € G,(p(x)) acts as a permutation of the roots r; and so 


a6 = (-1)76 


where (-1)% is 1 if o is an even permutation and -1 if o is an odd 
permutation. Hence, cA = A, implying that A € F. If char(F) = 2, then 
o6 = 6 for all o € Gp(p(x)) and so 6 EF. 

If char(F) #2, we have two possibilities. If 6€F then all o € 
Gy(p(x)) fix 6 and are therefore even. Hence Gp(p(x)) is isomorphic to 
a subgroup of the alternating group A,. If 6¢ F then Gp(p(x)) must 
contain an odd permutation. It is not hard to show that if a subgroup 
of S,, contains an odd permutation then the subgroup has even order 
and exactly half of its elements are even. Hence, if 6¢ F then Gp(p(x)) 
has even order and 


| Ge(P(0)) Ny | = 3] Gp(P(%)) 


If we let H = Gp(p(x)) NA, then F(H) < E is Galois, with Galois group 
H and so 


[E:F(H)] = |H| = 31 Gp(p()) | = Z[E:F] 


which implies that [F(H):F] = 2. But [F(6):F] =2 and F(6) C F(H), 
whence F(H) = F(6). In words, the fixed field of the even permutations 
in Gp(p(x)) is F(6). Let us summarize. 


Theorem 6.3.1 Let p(x) € F[x] have splitting field E. 


1) A=0 if and only if p(x) has multiple roots in E. 
2) Assume that A # 0 and char(F) F 2. 

a) If A has a square root in F, then the Galois group Gp(p(x)) is 
isomorphic to a subgroup of the alternating group A,. 

b) If A does not have a square root in F, then the Galois group 
Gy(p(x)) contains half odd and half even permutations of the 
roots of p(x). In addition, the fixed field of Gp(p(x))NA,, is 
F(VA). 

3) Assume that A # 0 and char(F) = 2. Then A has a square root in 

F, but Gp(p(x) need not be isomorphic to a subgroup of A,. 
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Proof. For part 3), observe that the generic polynomial 
g(x) = (x ty) -(«-t,) 
has Galois group S,, over F(s,,...,8,). 
The usefulness of Theorem 6.3.1 comes from the fact that A can 


actually be computed in some cases. To see why this is so, observe that 
6 is the Vandermonde determinant 


Tt rT. I. 
1 2 n 
6 = . . 
n-1 ,n-1 n—-1 
ty T ae Th 


Up Uy Un-i 
is uy Ke = 


Uj-1 Un ** Uon-2 
where u; =r trite tri, Newton’s identities can then be used to 
determine the u,’s in terms of the coefficients of the polynomial in 


question (see Example 6.2.1 and the exercises). We will see some 
examples of this in the next section. 


6.4 The Galois Groups of Some Small Degree 
Polynomials 


Quadratic Polynomials 


Quadratic extensions (extensions of degree 2) hold no surprises except 
perhaps for certain base fields of characteristic 2. Let 


p(x) = x? + bx +c = (x—r)(x—s) 


be a quadratic over F, with splitting field E. To compute the 
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discriminant, observe that u, =r-+s = b and 


uy =r? +s? = (r +s)? — 2rs = b? — 2c 


Hence 


2 »b 
A= = 2(b? — 2c) —b? = b? — 4c 
b b*-2c 


a familiar quantity. 
If A = 0 then p(x) has a double root r and 


p(x) = (x —r)* = x? — 2rx +1? 


The root r will lie in F for most well-behaved base fields F. In 
particular, if char(F) #2, then -2re¢F implies réF. If char(F) =2 
and F is perfect (a finite field, for example) then r € F. However, the 
following example shows that p(x) may have a multiple root not lying 
in F. Let F = Z,(t?) where t is transcendental over Z, and let 


p(x) =x? 1? = (x-t)? 


Since t ¢ Z,(t?), this polynomial is irreducible over Z,(t”), but has a 
multiple root t ¢ F. 

If A #0 then p(x) has distinct roots and there are two possibilities: 
(i) the roots lie in F, p(x) is reducible and Gp(p(x)) is trivial, or (ii) the 
roots do not lie in F, p(x) is irreducible and Gp(p(x)) ~ Z, is generated 
by the map o:r—s. When char(F) # 2, we can tell whether or not the 
roots lie in F by looking at the discriminant, since the quadratic 


formula gives 
-b+Vb?—4c_ -b+ V/A 
2 a 2 


1,s= 
Hence the roots lie in F if and only if A has a square root in F. 


Theorem 6.4.1 Let p(x) € F[x] have degree 2. 


1) If A=0 then p(x) =(x—r)? has a double root r, which may or 
may not lie in F. In any case, Gp(p(x)) is trivial. 

2) If A#0 then p(x) has distinct roots and there are two 
possibilities: (i) the roots lie in F, p(x) is reducible and Gp(p(x)) is 
trivial, or (ii) the roots do not lie in F, p(x) is irreducible and 
Gp(p(x)) ~ Z, is generated by the map o:r—s. 

3) If char(F) #2 then all quadratic extensions F < E have the form 
E = F(,/a), for some a € F. 
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Proof. Part 3) follows from the fact_that if A is the discriminant of 
min(a,F) then part 2) implies that WAN ¢ F, whence E = F(V/A). | 


Let us turn now to a more interesting case. 


Cubic Polynomials 


Let 
p(x) = x? + bx? +cx +d = (x —1)(x —s)(x —t) € F[x] 


have splitting field E. Then p(x) is irreducible if and only if none of its 
roots lie in F. Let us assume that p(x) is irreducible. A straightforward 
but lengthy computation gives 


A = -4b°d + bc? + 18bed — 4c? — 27d? 


Assume first that A = 0. Then p(x) has multiple roots and Corollary 
1.6.4 implies that p(x) = q(x"), where p=expchar(F) and p*>1. 
Since deg p(x) = 3, we must have p = 3, k = 1 and so 

p(x) = (x-1r)? =x? - 1 
has a single root of multiplicity 3. The extension F < F(r) = E is purely 
inseparable of degree 3 and the Galois group is trivial. 

If A #0 then p(x) has no multiple roots and is therefore separable. 
Hence, F < E is Galois and | Gp(p(x))| = [E:F]. Since r¢ F, we have 
[E:F] > 1, which leaves the possibilities [E:F] = 3 or 6. If p(x) splits in 
F(r), then [E:F] =3 and the Galois group is isomorphic to Aj ~ Z3. If 
p(x) does not split in F(r), then [E:F] = 6, in which case the Galois 
group is isomorphic to S,. When char(F) # 2, these two cases can be 
distinguished by examining the discriminant. If Va EF, then 
Gp(p(x)) = Ag and if /A ¢ F then Gp(p(x)) ~ S3- 


Theorem 6.4.2 Let p(x) € F[x] be irreducible of degree 3. 


1) If A=0 then p(x) has a single root of multiplicity 3 and 
char(F) = 3. The Galois group is trivial. 

2) If A#0 then Gp(p(x)) ~ Ag or S3. 

2) Let char(F) £2. If0F JA € F then Gp(p(x)) ~ Ag and adjoining 
a single root of p(x) to F gives the splitting field for p(x). If 
VA ¢ F then Gp(p(x)) ~ S3. 0 


Example 6.4.1 Let p(x) = x? — 2x?-—x+1 over Q. Any rational root of 
p(x) must be +1 (Theorem 1.2.2) and so p(x) is irreducible. The 
discriminant is A=49 which has a square root in Q and so 
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Go(p(x)) x Ag is cyclic of order 3. On the other hand, the irreducible 
polynomial q(x) = x?—x-+1 has discriminant A = —23, which has no 
square root in Q. Hence, its Galois group is isomorphic to S3. 0 


*Quartic Polynomials 


Since the Galois group of an irreducible quartic polynomial is 
isomorphic to a transitive subgroup of S,, we should begin by 
determining all such subgroups of S,. Theorem 0.2.21 implies that if G 
is a transitive subgroup of S, then |G| = 4, 8, 12 or 24. Here is a list. 


1) The cyclic group Z, occurs as a subgroup of S,, for instance 
((1234)) ~ Z,. 
2) The four group Z, x Z, occurs as a subgroup of S,. In particular 


V = {e, (12)(34), (13)(24), (14)(23)} 


is isomorphic to Z, x Z, and is known as the viergruppe. We leave 
it to the reader to show that V is normal in S,. This and the 
previous case exhaust all nonisomorphic groups of order 4. 

3) The dihedral group D, of symmetries of the square, thought of as 
permutations of the corners of the square, is a subgroup of S, of 
order 8. Since D, is a Sylow 2-subgroup of S,, all subgroups of S, 
of order 8 are isomorphic to D4. 

4) The alternating group A, is the only subgroup of S, of index 2, 
that is, of order 12. 

5) Of course, S, is the only subgroup of S, of order 24. 


Let p(x) = x*+ax>+ bx?+cx+d be an irreducible quartic over F 
and let us assume that char(F) # 2, 3. This will insure that 440 and 
that all irreducible cubic polynomials that we may encounter are 
separable. Replacing x by x—a/4 will eliminate the cubic term, 
resulting in a polynomial of the form 


q(x) =x*+ px? +qx+r 


The polynomials p(x) and q(x) have the same splitting field and hence 
the same Galois group, so let us work with q(x). Let E be the splitting 
field of q(x), let r,,...,14 be its roots in E and let G = G,(E) be its 
Galois group. For convenience, we identify G with its isomorphic image 
in Sy. 


The Quartic x! + bx” +c 


In order to get our feet wet, let us first consider the special case 
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q(x) =x?+bx? +c 
If we denote the roots of q(x) in E by ta, + then E = F(a,f) and 
b=-(a? + 6”), c= ap? 
We define the associated quadratic to q(x) to be 
G(x) =x? + bx+c 
The roots a” and (3? of q(x) are given by 


2 22 -b+ Vb? —4c 
i ae Po 


The irreducibility of q(x) can be determined as follows. Certainly if 
G(x) is reducible over F, then so is q(x). On the other hand, if q(x) is 
irreducible then its roots a? and §? do not lie in F, whence q(x) cannot 
have a linear factor over F and, if reducible, must have the form 


q(x) = x4 + bx? +c = (x? + ux + v)(x? — ux + w) 
where, as seen by equating coefficients, u(v — w) = 0. However, if u=0 
then 
q(x) = (x? + v)(x? + w) 
which gives 
(x) = (x + v)(x + w) 


contradicting the irreducibility of q(x). Thus, u #0 and. v = w. We can 
summarize as follows: 


1) If Vb?—4c € F then (x), and therefore q(x), is reducible. 
2) If Vb?—4c¢F then q(x) is reducible if and only if it has the 


form 
q(x) = x4 + bx? +.¢ = (x? + ux + v)(x? — ux + v) 


2 


where v? =c and 2v—u* =b. 


For example, let q(x) = x4 + 6x? +4 over Q. Then b?— 4c = 20 and 
Vv 20 ¢ Q. From 2) above we have 


v=4v= +2 
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and 
uw =2v-6= +4-6 = -2, -10 


and since the latter has no solutions u€éQ, we see that q(x) is 
irreducible over Q. 

Let us now assume that q(x) is irreducible over F and has distinct 
roots. We have seen that [E:F] = 4, 8, 12 or 24. However, not all 
permutations of the roots are elements of the Galois group G. For 
instance, if o € G sends a to #, it must send —a to —{. The possibilities 
for elements of G are listed below, where we give the action on a and , 
as well as a description as a product of transpositions, assuming that 
the roots are taken in the order a, 3, -a, —(. 


1) epee, 698 (1) 
2) og:a—-a, B-B = (24) 

3)  o3:a-a, BB (18) 

4) og:a-a, B-B  (13)(24) 

5) Os:a—8, Ba (12)(34) 

6) og:iaB, B>-a = (14)(13)(12) 
7) o7ia—--B, Boa  — (12)(13)(14) 
8) og:a—-f8, B-a (14)(23) 


Note that all nonidentity maps have order 2 except ag and ay. In fact, 
{o,,...,0g} is isomorphic to the dihedral group D,, with rotation o¢ 
(order 4) and reflection og (order 2). Thus G is (isomorphic to) a 
subgroup of D, and so [E:F] = 4 or 8. In the latter case G ~ Dy. In the 
former case, G ~ Z, or G2 V. 

The square root of the discriminant of q(x) is 


6 =(a—f)(a+a)(a+ B)(8 +a)(B + 8)(-« + 8) = -4a8(a? — 6)? 
and since (a? — 6*)* is invariant under each o;, it must lie in the base 
field F. Hence, 6 € F if and only if af € F, or equivalently, Ve EF. It 
follows from Theorem 6.3.1 that 


1) If Jc €F then G is isomorphic to a subgroup of A,. Thus, it 
contains only even permutations and so 


G = {0},04,05,03} = V 


2) If ,/e¢F then G contains half even and half odd permutations 
and GN Ag is F(5) = F(./c). 
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Under case 2), we still have the possibilities |G| =4 or |G| =8. 
In the former case, GMA, must consist of o, and one of the even 
permutations o4, o, or og. The other two elements of G must come 
from the odd permutations 74, 03, og and oy. If G has no element of 
order 4, then we can eliminate og and oy. But it is easy to check that 
the set {o,,0;,05,03}, where i= 4, 5 or 8, is not transitive on the roots 
of q(x). Hence, G must contain an element of order 4 and 


G={o,,0¢/08 =o, 03 = 07} SZ, 

To identify the case G ~ Z, directly from the coefficients of q(x), 
observe that in this case [E:F(,/c)] = 2 and so q(x) has an irreducible 
quadratic factor over F(,/c). Thus, 

q(x) = x4 + bx? +c = (x? + ux + v)(x? — ux + w) 
where r(x) = x? + ux +v is irreducible over F(,/c). Since 
Cr fy) = {o,, os} 


it follows that o, must send one root of r(x) to the other root and so 
the roots of r(x) are +a or +f. In either case, u = 0 and so 


(6.4.1) q(x) = x4 + bx? +c = (x? + v)(x? + w) 
which implies that q(x) is reducible over F(,/c), that is, 


Vb? — 4c € F(,/c) 


\/c(b? — 4c) € F 


Conversely, if this holds, then G(x) is reducible over F(,/c) and 
therefore q(x) has the form (6.4.1), where v, w € F(,/c). Since vw =c, 
the polynomial q(x) splits over F(/c,/V), whence E = F(/c,/¥). 
Thus, [E:F(,/c)] = 2 and [E:F] = 4. Let us summarize. 


or, equivalently, 


Theorem 6.4.3 Let q(x) = x4 +bx? +c be irreducible with distinct roots 
over F. Let G be the Galois group of q(x). Let V be the viergruppe. 


1) If /ceF then G=V. 


2) If /e¢F and y/c(b?— 4c) € F then G ~ Z, and G(x) is reducible 
over F(4/c). 
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3) If Vce¢F and y/c(b?—4c)¢F then G~xD, and x) is 


irreducible over F(,/c). 0 


The General Quartic 


To analyze the general quartic polynomial p(x), we consider which 
elements of the Galois group lie in the viergruppe V. This gives us a 
subgroup VNG of G and hence an intermediate subfield F(VMG) of 
the splitting field E. Since VS, we have VOG 4G. Our guide will be 
Figure 6.4.1. 


E = split(p(x)) G,(p(x)) = G 
possible degrees: 
degree: 1,2 or 4 1 [r(x) splits over F] 


2 [r(x) has one root in F] 
3 or 6 [r(x) irred. over F] 


FVOG) = split(r(x)) VAG 
possible degrees: 
1 [r(x) splits over F] degree: 1,2 or 4 
2 [r(x) has one root in F] 
3 or 6 [r(x) irred. over F] 


F 
Figure 6.4.1 


To determine the fixed field of VMG, note that each element of V 
fixes the elements 


w= (ry +19)(t3 +14) 
v = (ry +13)(tg +14) 
w = (ry +14)(tg +13) 


and so F(u,v,w) < F(VNG). By checking each permutation in S,, it is 
not hard to see that no permutation outside of V fixes u, v and w. 
Thus, 

Crruv,w)(E) < VAG 


Taking fixed fields gives F(VNG)<F(u,v,w) and so F(VNG)= 
F(u,v,w). 

Note also that any element of S, permutes the elements u, v and w 
and so any symmetric function of u, v and w is also a symmetric 
function of r,...,T4. 


Definition The resolvent cubic of q(x) =x*++px?+qx+r is the 
polynomial r(x) = (x — u)(x — v)(x — w). 0 
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To determine the coefficients of r(x), note that since q(x) has no 
cubic term, it follows that r, +ry +13 +1, = 0. Hence, 


(ty +1)? = -(ty +1r9)(t3 +14) = -u 


Thus, r(x) is a polynomial satisfied by -(r, +12)”. The polynomial q(x) 
factors into a product of quadratic polynomials over E, say 


q(x) = (x? + ax + b)(x? — ax +c) 


where the linear coefficients are negatives of each other since q(x) has 
no cubic term. We can always renumber so that the roots of the first 
factor are r, and ry, whence a=-(r,+r,). Multiplying out the 
expression for q(x) and equating coefficients gives 


b+c—a2=p 
ac—ab=q 
be =r 


Solving the first two for b and c and substituting into the third gives 
a° + 2pat + (p? — 4r)a? — q? =0 
and so a? = (ry + #5)" = -u satisfies the polynomial 
s(x) = x? + 2px? + (p? — 4r)x — q? 
Thus u satisfies the polynomial 
t(x) = x3 — 2px? + (p? — 4r)x + q? 

Since we will get the same polynomial by repeating this argument 
using T; +13 or r,; +14 in place of r, +15, we deduce that t(x) is the 
resolvent cubic of q(x). 

Theorem 6.4.4 The resolvent cubic of q(x) = x4 + px? +.qx +r is 
r(x) = x? — 2px? + (p? — 4r)x +.” 
The splitting field of r(x) over F is the fixed field F(VNG). Hence, 
| Gp(r(x)) | = [F(VNG):F] = (G:VNG) 0 


Let us put all of the pieces together. 
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Theorem 6.4.5 Let p(x) = x4+ax>+bx?+cx+d be an_ irreducible 
quartic over a field F, with char(F) # 2, 3. Let q(x) =x4+ px? +qx+4r 
be obtained from p(x) by substituting x—a/4 for x and let r(x) = 
x? — 2px? + (p? — 4r)x +.q? be the resolvent cubic of q(x). Let A, be the 
discriminant of r(x) and let E be the splitting field for p(x) over F. 


1) If r(x) is irreducible over F and ,/A, € F then Gp(p(x)) ~ Ay. 
2) If r(x) is irreducible over F and ,/A, ¢ F then Gp(p(x)) ~ Sy. 
3) If r(x) splits over F then Gp(p(x)) ~ V. 


4) If r(x) has a single root in F there are two possibilities: (i) if p(x) 
is reducible over F(VNMG) it has an irreducible quadratic factor 
and Gp(p(x)) ~ Z4, (ii) if p(x) is irreducible over F(VNG) then 
Gp(p(x)) & Dy. 

Proof. Let G = Gp(p(x)). The situation is described in Figure 6.4.1. We 

begin by observing that VMG < V and so 


|VNG| =1,20r4, (G:VNG) =1, 2, 3 or 6 
and 


| VNG| x(G:VNG) = |G| =4, 8, 12 or 24 


This shows immediately that |VNG|#1. Let us write the 
possibilities as follows 


|VNG| x(G:VNG) = |G| 
(2/4) x (1/2/3/6) = (4/8/12/24) 


Now we can use Theorem 6.4.2. Let 5 be the splitting field of r(x) 
over F. 


1) If r(x) is irreducible and \/A, €F then Gp(r(x)) ~ Ag. Theorem 
6.4.4 gives (G:V MG) = 3, which implies that | VMG| =4 and so 
|G] =12. Thus G ~ Ay. 

2) If r(x) is irreducible and ,/A,¢F then Gp(r(x)) ~S3. Hence 
(G:VNG) =6. If |VNG| =2 then |G| =12 so Gx Ay. But 
VCA, then implies that |VNG|=4. Thus, |VNG| £2, 
leaving the only other possibility: | VMG| = 4. Hence |G| = 24 
and G ~ S,. 

3) If r(x) splits over F then (G:VNG)=[S:F]=1 and so 
| VNG| =4, whence VCG and Gr V. 

4) Suppose that r(x) has a single root in F. Then (G:VNG) = 

- [S:F]=2. There are two possibilities. If |VNG|=2 then 
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|G| =4 and so G~Z, or V. We leave it to the reader to show 
that since G acts transitively, G~V is not possible. Hence, 
G~Z,. Note that, in this case, since E is the splitting field for 
p(x) over S and [E:S]=2 the polynomial p(x) must have an 
irreducible quadratic factor over S. If |VNG| =4 then |G| =8 
and G ~ D,. In this case, p(x) is irreducible over S. § 


Exercises 


1. 


Let p(x) = x" — agxts? +-+++ a, where a,,...,a, are algebraically 
independent over F. Show that p(x) is irreducible over 
F(a,,...,a,), separable and its Galois group is isomorphic to S,,. 
Give an example to show that separability is required in Corollary 
6.2.3. 

If p(x) is a quartic polynomial then its discriminant is the 
negative of the discriminant of its resolvent cubic. Hint: u—v = 
—(ry —14)(t2 — 13): 

Find the Galois groups of the following polynomials: (i) 
x4—10x?41; (ii) xt-4x+4+2; (iii) x4+8x-12 (iv) 
xt+x?4x41 

If p(x) € F[x] has roots r,,...,r, then A = Gee ITp’(x,)- 


Let p(x) € Q[x] have degree 3. Show that A < 0 if and only if p(x) 
has exactly one real root. 
Show that the splitting field for an irreducible cubic polynomial 
over F is given by F(+/A,r), where r is a root of f(x) and A is the 
discriminant. 
Let p(x) = (x —r)(x—s)(x—t), where r, s and t are algebraically 
independent over Z,. Let s,, 8, 83 be the elementary symmetric 
functions on r, s and t. Show that VA € F(s,,89,83) but the Galois 
group of p(x) over F(s,,89,83) is isomorphic to S3. 
Let 

p(x) =x"—s,x"-1 4.--.+4(-1)"s, 


have roots r,...,%, in a splitting field E over F. Let uj= 
tr +r,t+:::+1,. Since the u,’s are symmetric polynomials in the 
roots of p(x), Theorem 6.2.2 implies that they can be expressed as 
symmetric polynomials in the elementary symmetric functions 
$1)---,5,- One way to derive an expression relating the u,’s to the 
s,s is by following the proof of Theorem 6.2.2. Here is another 
way. Let p(x) = (x —1;)q;(x) in E[x]. 


a) Show that Dk+1p(x) = Y Dkq,(x). 
i 
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10. 


11. 


12. 


b) Write 
q(x) = pert) 
and use part a) to derive Newton’s identities: 
Uj — Uj 784 + Uy 98 +++ + (=1)*tuys,_, + (-1)*ks, = 0 


for k = 1, 2, 3,.... 
c) Let p(x) =a+bx+x™. Find the values of u; and find the 
discriminant of p(x). 
Show that the viergruppe V is normal in S,. Find another 
subgroup of S, besides V that is isomorphic to V. 
This exercise concerns the issue of when a real number that is 
expressed in terms of nested radicals 


a= /r+sy/t 


where r, s, t € F can be written in terms of at most two unnested 
radicals. For instance, we have 


dt V2i =i V6 + V4) 


but the number 4/7 + 2,/5 cannot be so written. Note that a is a 


root of the quartic 


a(x) = x4 — rx? + (1? —97t) = fx? - (r+ si]? - (r-sVi)] 


Assume that q(x) is irreducible over F. The question we are 
interested in is whether a € F(./P,./4) for some p and q in F. 
Show that the answer to this question is yes if and only if 
F(,/P,,/4) is the splitting field E for q(x) over F. Then show that 
E= F(./P; 4/4) if and only if the Galois group G of q(x) over F is 
the viergruppe V. Hence, a € F(,/p,,/@) if and only if 


Vr2—s*t EF 


Find a way to compute the unnested expression for a in terms of 
p and ,/q. 

Let p(x) = x4 + bx? + cx? + dx +1 € Q[x] have Galois group G. 

(i) If u=c? +4c +44 — 4b? has a square root in Q then G ~ V. 

(ii) If u does not have a square root in Q but u(b?— 4c +8) does 
have a square root in Q then G ~ Z,. 

(iii) If neither u nor u(b*—4c+8) has a square root in Q then 
G ~ Dy. 


Chapter 7 
A Field Extension as a Vector Space 


In this chapter, we take a closer look at a finite field extension F < E 
from the point of view that E is a vector space over F. It is clear, for 
instance, that any o € Gp(E) is a linear operator on E over F. However, 
there are many linear operators that are not field automorphisms. One 
of the most important is multiplication by a fixed element of E, which 
we study next. 


7.1 The Norm and the Trace 


Let F <E be finite and let a€ E. The multiplication map @:E-E 
defined by @@ = af is an F-linear operator from E to E, since 


a(uf + vy) = ua@B + vay 


for all u, vE F and 6, y€ E. We wish to find a basis for E over F 
under which the matrix of & has a nice form. 

Note that if r(x) € F[x], then r(@)$ =r(a)@ for all @€E and so 
t(~) = 0 as an element of E if and only if r(@) is the zero operator on E. 
Hence, the set of polynomials over F satisfied by @ is precisely the same 
as the set of polynomials satisfied by a. In particular, they have the 
same minimal polynomial over F. 

The vector subspace F(a) of E is invariant under @, since @(p(a)) = 
ap(a) € F(a). If B= {G,,...,8g} is an ordered basis for F(a) over F 
and if 


d 
a8; = 2 bis 
j= 
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then the matrix of @| p(,) with respect to B is M = (b; ;). If {71)---s Ye} 
is a basis for E over F(a) where e = [E:F(a)], then the set of products 


C= {Bas Wy Bay eos MB gre sere++s Ves VeBare++s VePat 


is a basis for E over F. Since 
d 
a(%4f;) = obi jn 
j=1 


it follows that each of the subspaces Vy = (74) 1489)+++) MPa) is 
invariant under @ and the matrix of @|y_ is also equal to M. Hence, 
the matrix of @ with respect to the ordered basis © has the block 
diagonal form 


0 
(7.1.1) No(@) = : 


M 0 
0M 
00° 
000 


Zoco 


Thus, if the characteristic polynomial of @ le i) is q(x), then the 
characteristic polynomial of @ (on E) is 
da(X) _ q(x) EF 
But q(x) € F[x] has degree [F(a):F] = deg min(a,F), is monic and is also 
satisfied by a, whence q(x) = min(a,F). 


Theorem 7.1.1 Let F <E be finite and let a€ E. If @:E-E is the F- 
linear operator on E defined by @8=af then the characteristic 
polynomial of @ is 


da(x) = [min(o, Fy] EFC) : 


We recall from linear algebra that if r:V—V is a linear operator on a 
finite dimensional vector space V over F, the trace of r is the sum of 
the eigenvalues of r and the norm (determinant) of 7 is the product of 
the eigenvalues of 7, in both cases counting multiplicities. Recall also 
that the trace and the norm both lie in the base field F. We are 
motivated to make the following definition. 


Definition Let F < E be finite and let a € E. The trace of a over F < E, 
denoted by Trp pr(@)s is the trace of the F-linear operator @:E—-E and 
the norm of a over F < E, denoted by Np pel), is the norm of @:E>E.0 
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Note that the trace and norm of a depend on the extension field E, and 
not just on the element aq itself. 

Since the trace of a linear operator is the sum of the roots of its 
characteristic polynomial and the norm is the product of these roots, 
Theorem 7.1.1 allows us to express the trace and norm in terms of the 
roots of the minimal polynomial. Let F < E be finite, let a € E and let 


p(x) = min(a,F) = xd + aggre ese 


have roots r,,...,rg in a splitting field. It follows from Theorem 7.1.1 
that 


d 
Trg/p(@) = [E:F(a)] i = -[E:F(a)Jaj_, 


and : 
Nero) = [nero = [(-1)4a,} EF 
i=1 


We remark that many authors simply define the trace and norm of a 
directly from these formulas. 

Alternate expressions for the trace and the norm can be obtained as 
follows. Let r,,...,1, be the distinct roots of p(x). Each of these roots 
appears with multiplicity [F(a):F],; (Theorem 4.6.1) and so 


Trg p(@) os (EF(oEPC@):F Ys = (EF(O)) EFI Son 
and 


s s 
Ngyp(a) soe 1 a a 1} Cee 
i=1 i=1 

Now let us take a look at the trace and the norm from the 
perspective of embeddings of E into an algebraic closure. Let F < E be 
finite and let Hom,(E,F) = {o1,...,0,} be the set of all embeddings of 
E into F over F. If a€ E and p(x) = min(a,F), then o,a,...,0,0 is a 
list of the roots of p(x) in F. However, each root may appear more than 
once in this list. 

To see how many times each root appears, consider the tower 
F < F(a) < E. Each embedding o; is obtained by extending to E an F- 
embedding 7 of F(a) into F, and this can be done in [E:F(q)], different 
ways. Each extension of r has the same value on a and each embedding 
t of F(a) into F has a different value on a. Hence, the list 04 Q,...,0,0 
contains exactly [E:F(a)], copies of each root of p(x). Thus, if r,,...,r, 
are the distinct roots of p(x) in F, then 
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Soa = [E:F(a)], Sr, 

i=1 i=1 
and 

n s Fla 

I oa = If" Ne 


These formulas give another expression for the norm and the trace. Let 
us summarize. 


Theorem 7.1.2 Let F<E be finite and let a€E with p(x) = 
min(a,F) = x? + agaaxe +++ ap. 


1) If p(x) has roots r,,...,rg then 
d 
Trp/p(@) = [E:F(a)] DoH = -[E:F(a)Jag_, 
and Aj - 
Neye(a) = [[rPO! = ((-1)4ag EFC 
i=1 
2) If p(x) has distinct roots r,,...,r, then 


Trgp(@) = (B(@) EF, 


and 
5) [E:F(a)]_[E:F]. 
Ngyr() 2 I (a)],[ Fi 
i=1 
3) If Homp(E,F) = {o,,...,0,,} then 
n 
Trg p(@) = [E:F]; )> ojo 
i=l 


and 


i=1 


Theorem 7.1.2 can be used to derive some basic properties of the 
trace and the norm. We leave proof of the following to the reader. 
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Theorem 7.1.3 Let F < E be finite. 


1) The trace is an F-linear functional on E, that is, for all a, BEE 
and a, be F, 


Trp jp(aatbf) =aTrp jp(a)+b Trp jr(P) 


2) Foralla, @€EandaeF 


Ng/p(@8) = Ng/p(@)Ng/p(8) and Ngyp(ac) = all NE g(a) 
3) IfaeéF then 
Trp/p(@) = [E:Fla and Ngjp(a) = al®*l 
3) IfF <E<L are finite and if a €L then 
Try jp(@) = Treyp(Tr ppl), Nyypl@) = Neyp(Npje(2)) 0 


*7,2 The Discriminant of Field Elements 


Our goal in this section is to describe conditions that guarantee that 
a given set {a,,...,a,} of elements of E is a basis for E over F. We 
begin with a few remarks on metric vector spaces. (For more details, see 
Roman, Advanced Linear Algebra, Springer-Verlag, Graduate Texts in 
Mathematics Vol. 135, 1992.) 


Definition Let V be a vector space over a field F. A mapping 
(,):V x VF is called a bilinear form if it is a linear function of each 
coordinate, that is, if for all x,y € V and a,G € F 


(ax + By,z) = a(x,z) + B{y,2) and (z,ax + By) = a(z,x) + B(z,y) 


The pair (V,(,)) is called a metric vector space. A bilinear form is 
symmetric if (x,y) = (y,x) for all x, yeE V.0 


If S CV, we let (x,S) = {(x,s) |s € S}. 


Definition A metric vector space is nonsingular if (x,V) = {0} implies 
that x = 0. A metric vector space V is null if (x,y) = 0 for all x, y€ V.0 


If B= {b;} is a basis for V over F and if x = }°x;b; € V, we will 
denote the coordinate (row) matrix (x,,...,x,) by the boldface notation 
x. The matrix of the form (,) with respect to B is 
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Mg = ((b;;b;)) 


Here are some key facts about the matrix of a form. We leave proof 
to the reader. 


Theorem 7.2.1 
1) If Mg is the matrix of a bilinear form on V then 


(x,y) = xMay" 
for all x, y EV. 

2) Two matrices M and N represent the same bilinear forms on V, 
with respect to possibly different bases, if and only if they are 
congruent, that is, if and only if M= PNP" for some invertible 
matrix P. 

3) A metric vector space is nonsingular if and only if any, and hence 
all, of the matrices that represent the form are nonsingular. 0 


Now we can return to the business at hand. Let F < E be a finite 
extension and let 


(7.2.1) (a,8) = Trgjp(@P) 


for all a, 8 € E. This is easily verified to be a symmetric bilinear form 
on E over F. If B = {(,,...,8,} is a basis for E over F, then the matrix 
of the form (,) is 


Mg = ((8;,8;)) a (Trg /p(4;4;)) 


This form has rather special properties, due to the fact that 


(ya,8) = (a,78) 
for all a, 8, yEE. 
Theorem 7.2.2 Let F <E be finite, with form given by (7.2.1). Then 
either 


1) Eis null and the trace map is identically zero, or 
2) Eis nonsingular and every matrix representing (,) is nonsingular. 


Proof. If E is singular then (a,E) = 0 for some a #0 and so (1,E) = 
{0}. It follows that (,) is null and the trace map is identically zero. i 


Thus, any matrix representing the form (7.2.1) is either the zero 
matrix or it is nonsingular. Note that, if char(F) = p #0, then the zero 
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matrix will arise when p | Trg; y(@) for all a € E. Referring to part 3) of 
Theorem 7.1.2, we see that tiie happens when [E:F]; > 1, since [E: F]; is 
a power of p. In other words, if F < E is not separable, then E is null. 
The converse also holds. 


Theorem 7.2.3 Let F<E be finite, with form (7.2.1). Then E is 
nonsingular if and only if F < E is separable. 


Proof. We have just seen that if F <E is not separable then E is 
singular. For. the converse, suppose that F < E is finite and separable. 
Then there exists a primitive element a & E. If E = F(a) has degree n 
over F then the elements 1, a,...,a°~! form a basis for E over F. 
Referring to part 3) of Theorem 7.1.2, and letting a; = o,a be the roots 
of min(a,F) and of = (ak,...,ak), we have 


Trgjp(o*as) = J o;(akal) = Sakai = oX(al)" 
i=1 


i=1 


Thus, if V is the Vandermonde matrix 


a 1 1 1 
1 
a a, a Qy-4 
V=| a |= af = ad ie 4 
n-1 -1 -1 -1 
a at art ann 
then 
k jy) — T 
[Trp pro a@)| = VV 
and so 


det(Trgp(aXo#)) = (det V)? 


It is well-known that 
det V = |] (a;— a) 
i<j 
Since a is separable, the a;,’s, being the roots of the separable 


polynomial min(a,F), are distinct and so det V #0. Hence (,) is 
nonsingular. 
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In view of the previous results, the trace map and the form (7.2.1) 
are interesting only when the form is nonsingular, that is, only when 
F < E is separable. 


Definition Let F < E be finite and let Q,,..-,@, € E. The discriminant 
of a,,...,@, is the determinant 


Agr (O01) = | (a5105) | = | Trp pp(aio5) | 


Thus, if a,,...,a, is a basis for E over F then the discriminant is the 
determinant of the matrix that represents the form (7.2.1) with respect 
to this basis. 0 


When F < E is finite and separable, the discriminant can be used to 
determine whether or not a set of vectors is a basis for E over F. 


Theorem 7.2.4If F<E is finite and separable of degree n, then 
{a,,...,@,} is a basis for E over F if and only if Agyjpl@is-++1%) #0. 


Proof. Since E is nonsingular, if {a,,...,@,} is a basis for E over F, 
then ((a;,a;)) is nonsingular and so Agp(Qq)+++1%n) # 0. Conversely, 
assume that Appl: ..,@,) # 0 and that 


aia; =0 
i 
for a; € F. Multiplying by a, and taking the trace gives 
i 


and since the rows of the matrix (Trp jr(%53)) are linearly independent, 
we have a; = 0 for all i, whence {a,,...,@,} 1s a basis for E over F. 


We next derive an alternate expression for the discriminant. Let 
F <E be finite and separable and let a,,...,a, € E. Let Hom(E,F) = 
{o,,...,0,,} and consider the matrix 


Oy O70, i O,% 


o,a Oo wes 0,a@ 
(7.2.2) M(qy).-.0,)=| 17 7? ? 
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If A = M(ay,...,@,,) and B= M(f,,...,8,) is the corresponding matrix 
for B,,...,8, € E then the (i,j)-th entry of AB’ is 


(AB"); = 3 04.004.8; = D> oy(08;) = Trp sp(%F)) 
and so ' : 


M944) M(Byp- 4g)" = (Trp ay8) 


In particular, if @; = a; for all i, then 


M(04,--+)Q,)M(a4,.--;0,) = (Trg pp(o;05)) 
Taking determinants gives the following. 


Theorem 7.2.5 Let F < E be finite and let a,,...,a, € E. Then 


ApysplOys- +++) = | M(ay)---1@q) |? 


Thus, {a,,...,@,,} is a basis for E over F if and only if 


|M(a,,..-,a,) | #0 0 


*7.3 Algebraic Independence of Embeddings 


Let E and L be fields. Recall that the Dedekind Independence 
Theorem (Corollary 2.8.7) says that any set {c,,...,0,,} of distinct 
embeddings of E into L is linearly independent over L. To put this 
another way, let A; €L. and consider the polynomial p(x,,...,x,) = 
>> ;x;. Then the Dedekind Independence Theorem says that if 
p(o,,...,0,) is the zero map, then p(x,,...,x,) must be the zero 
polynomial. Under certain circumstances, we can strengthen this result 
considerably. 

If o,,...,0, are embeddings of E into L and if p(x,,...,x,) is a 
polynomial with coefficients in L then p(oj,...,¢,,) is a function from E 
into L, defined by 


P(01,...,0,)4 = p(o4a,...,0,,a) 


Note that we are dealing here with the product of maps, and not the 
composition. Thus, for instance, if n = 1 and p(x) = x, we have 


p(o)a = p(oa) = (ca)? 
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and not p(a)a = 0 


a =o(ca). 

Definition Let F < E. A set of distinct F-embeddings {c,,...,0,,} of E 
into a field L is algebraically independent over L if the only polynomial 
P(X,,---)X,) € L[x,,...,X,] for which p(o,,...,0,,) is the zero function is 
the zero polynomial. 0 


Theorem 7.3.1 Let F be an infinite field, let F<E be finite and 
separable of degree n. Then any set {o,,...,0,} of distinct F- 
embeddings of E into any field L is algebraically independent over L. 


Proof. Suppose that p(x,,...,x,) is a polynomial over L for which 
P(o,,---,0,)@ = 0 for all a EE. Let {a;} be a basis for E over F. Then 
for all a; € F, we have 


p(o, » ajQi,.0., Cas a;o;) = p( 2a %, ony a) =0 
i i i i 
This implies that the polynomial 
A(X 19++ +9 Xp) = PCD XO As.) DX} 
i i 


over L satisfies q(a,,...,a,,) = 0 for all a; € F. It follows from Theorem 
1.3.4 that q(x,,...,x,) = 0, that is, 


P( 32 XjO105,---) 2 Xj0,05) = 0 


Now, the matrix M(qj,...,a,) = (o;a;) is nonsingular by Theorem 7.2.5 
and so for any (,...,8, € L, there exists x,,...,x,, € E such that 


By = 0x0 10; 5-6) By = DO X0,% 


Hence p((},...,8,,) = 0 for all @; € L, implying that p(x,,...,x,) =0. 


*7.4 The Normal Basis Theorem 


Let F <E be a finite Galois extension of degree n. Since F < E is 
finite and separable, there exists a A€ E such that E = F(A). As we 
know, the set {1,\,...,A"~!} is a basis for E over F. This type of basis 
is called a polynomial basis. If Gp(E) = {0,,...,0,} then the elements 
0,A,...)0,A are precisely the roots of min(\,F) and so they are distinct. 
If they are linearly independent, then they also form a basis for E over 
F, called a normal basis. Put succinctly, a normal basis is a basis for E 
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over F consisting of the roots of some minimal polynomial min(\,F), for 
AEE. 

We wish to show that any finite Galois extension has a normal basis. 
Theorem 7.2.5 can be reworded for finite Galois extensions as follows. 


Theorem 7.4.1 If F < E is finite and Galois, with Gp(E) = {,,...,0,} 
then {j,...,,} is a basis for E over F if and only if det(o,A;) # 0. 


Proof. We give a proof that does not use the notions of Section 7.2. Let 
o = >) 8,0; for B; € E. Since distinct F-automorphisms of E are linearly 
independent over E, it follows — o = 0 if and only if 6; =0 for all i. 
Now suppose that B = {Aj).-- d,} is a basis for E over F. Then o = 0 if 
and only if od; = = 0 for all j, that i is, if and only if }>, 38,0}; = = 0, for all 
j=1,...,n. It “follows that 189}; = 0 for all j =1,...,n if and only 
if 8; = 0 for all i=1,...,n. Hence, det(o;A;) #0. 
Conversely, suppose that det(o;A i) #0 and let y iP; dj = 0. Then 


j 
for all i=1,...,n. It follows that 6; =0 for all j = 1,...,n and so {A;} 


is a basis for E over F. & 


Theorem 7.4.1 implies that {o;\} is a (normal) basis for E over F if 
and only if det(o;0;) #0. Our goal is to find such an element \€ E, 
when F < E is finite and Galois. 

Consider the matrix 


10, 902 1° 
Ooo 50 O50. 
2° 1 2-2 2°n 
M — . 
O°} 1% On 


For each i, the product ojo; runs through o,,...,0, as j runs through 


1,...,n, and so each row of M is a distinct permutation of o,,...,0,. 
The same applies to the columns of M. Thus, we may write 


oO o oO 

1 lo ly 
oO oO (og 

Mealy eke - 88 2n 
(on oO 
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where (1;,2;,...,n;) is a distinct permutation of {1,...,n}, fori=1,...,n 
and (j,,J9,---:Jn) is a distinct permutation of {1,...,n}, for j =1,...,n. 
Replacing each o; by an independent variable x; gives the matrix 


x xX xX 

1, 1) 1, 

2 X9 Xo 

NGG ei) = A 2 a 
x x tee x 

ny ng Mn 


We claim that the polynomial p(x,,...,x,) = det(N(x,,...,x,)) is 
nonzero. Each row of N is a distinct permutation of the variables 
Xj,--.,X, and similarly for each column. Thus N(1,0,...,0) is a 
permutation matriz, that is, each row and each column of N contains 
one 1 and the rest 0’s. Since permutation matrices are nonsingular 


p(1,0,...,0) = det(N(1,0,...,0)) #0 


Hence, p(x,,...,X,,) #0. 

If F is an infinite field, Theorem 7.3.1 implies that the distinct 
embeddings o,,...,0,, of E into L are algebraically independent over L 
and so there exists a A € L for which 


det(o,0;\) = (det M)(A) = p(oy,..-,0,)A #0 
Thus, we have proven the following. 


Theorem 7.4.2 If F is an infinite field then any finite Galois extension 
F < E has a normal basis. 0 


This result holds for finite fields as well and the proof will be given 
in Chapter 8. 


Exercises 
1. Let F< E be finite. For all a, BEE, 


Trg plot) = Trpjp(@)+T rg p(B), Neyp(@B) = Ngp(o)Ngp() 


2. Let F<E be finite. If a€F then Trg ppl) =[E:Fla and 
_ ,(E:F 
Nayr(@) =a"), 
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3. 


10. 


If F< E <L are finite and if a € L then 


Let F<E be finite and let o € Hom,(E,L). If a€E then 
N og /oF(7®) = o(Np ye(@)): State and prove a similar statement 
for the trace. 
Find a normal basis for the splitting field of p(x) = x4 —5x7+6 
over Q. 
If F is a finite field of characteristic 2 show that every element of 
F has a square root in F. 
If F is a finite field of characteristic p #2 then exactly half the 
nonzero elements of F have square roots in F and that if a € F has 
a square root in F then the set of all squares in F is {87a | 6 € F}. 
Let F<E be a finite separable extension, with E = F(a). Let 
p(x) = min(a,F) have degree n. Show that the discriminant 
Agyp(1,0,.--,a"71) is given by Car Ne ye(P')). 
Let F < E be finite and separable with form (7.2.1) and let {a;} be 
a basis for E over F. The dual basis {8;} to {a;} is a basis with 
the property that 

Trg pp(%P}) = (04:83) = 65 
where 6,;;=1 if i=} and 0 otherwise. In matrix terms, {a;} and 
{8;} are dual bases if 


M(ay,---,@,)M(Ay,---58,) =I 


where M is defined by (7.2.2). A basis for E over F is called a 
polynomial basis if it has the form {1, a,...,a"~1} for some a € 
E. Any simple algebraic extension E = F(a) has a polynomial 
basis. Let F <E be finite and separable, with polynomial basis 
{l,a,...,a"~1}, Let 


p(x) = min(a,F) = (x —a)(aq + a,x +++++a,_,x"7") 


Prove that the dual basis for {1,a,...,a"1} is 


ire) nite sat 

Pay Pla)’ pe) 

If V is a vector space, let V* denote the algebraic dual space of all 

linear functionals on V. Note that if dim V is finite then dim V = 

dim V*. 

a) Prove the Riesz Representation Theorem for nonsingular 
metric vector spaces: Let V be a finite dimensional 
nonsingular metric vector space over F and let f€ V* be a 
linear functional on V. Then there exists a unique vector x € 
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b) 
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V such that fx =(y,x) for all ye V. Hint: Let ¥,:V—-F be 
defined by ¥,(y) = (y,x). Define a map r:V-V™ by rx = ¥,. 
Show that 7 is an isomorphism. 

Let F <E be finite and separable, with form (7.2.1). Prove 
that, for any linear functional r:E—F there exists a unique 
a € E for which rf = Trpjp(oF) for all BEE. 


Chapter 8 
Finite Fields I: Basic Properties 


In this chapter and the next, we study finite fields, which play an 
important role in the applications of field theory, especially to coding 
theory, cryptology and combinatorics. For a thorough treatment of 
finite fields, the reader should consult the book Introduction to Fintte 
Fields and Their Applications, by Lid] and Niederreiter, Cambridge 
University Press, 1986. 


8.1 Finite Fields 


If F is a field, then F* will denote the multiplicative group of all 
nonzero elements of F. Let us recall some facts about finite fields that 
have already been established. 


Theorem 8.1.1 Let F be a finite field. 


1) F has prime characteristic. (Theorem 0.3.2) 

2) F* is cyclic. (Corollary 1.3.5) 

3) Any finite extension of F is simple. (Corollary 4.4.5) 

4) F is perfect, and so every algebraic extension of F is separable. 
(Theorem 4.8.2) 0 


Lemma 8.1.2 If F is a finite field and [E:F] =d then JE] = |F|% 


Proof. If {a,,...,a@g} is a basis for E over F, then each element of E has 
a unique representation of the form a,a,+:+:+agag, where a; €F. 


Since there hy | F | possibilities for each coefficient a;, we deduce that 
JE| =|F|°%8 


162 8 Finite Fields I 


Since a finite field F has prime characteristic p, we have Z, <F and 
so Lemma 8.1.2 gives 


Corollary 8.1.3 If F is a finite field with char(F) =p, then F has p™ 
elements for some positive integer n. 0 


From now on, unless otherwise stated, p will represent a prime 
number, and q will represent a power of p. 


8.2 Finite Fields as Splitting Fields 


We have seen that every finite field of characteristic p has p™ 
elements for some n>Q. Let us now show that there is, up to 
isomorphism, exactly one field of size p", for each prime p and each 
integer n > 0. 

Let q = p” and let S be the splitting field for the polynomial 


f(x) = xt -x 


over Z,.. If R is the set of roots of f,(x) in S, then a, @ € R imply that 
a? = q@ and 64= G, whence 


(at B)t=a9+ BI=a+fB and (aB~!)1= 08%"! = ap} 


Hence a+ BE R and af~' ER. It follows that R is a field and R=S. 
Furthermore, since 


' ts bs ose 
fy (x) = ax 1=-1 


the polynomial f,(x) has no multiple roots in S and so {S| =q. Thus, 
there exists a finite field S of size q =p" for every prime p and every 
positive integer n. It is customary to denote such a field by Fy or 
GF(q). (The symbol GF stands for Galois Field, in honor of Evariste 
Galois.) 

To establish uniqueness, observe that if F is a field of size q =p", 
then F* is a multiplicative group of order q—1 and so every a € F* 
satisfies at-!=1. Thus, every a€F is a root of the polynomial 
f,(x) = x4—x. Since this polynomial has exactly q roots, F is the set of 
roots of f,(x) and is therefore the splitting field for f,(x) over Z,. Since 
any two splitting fields for f. (x) are isomorphic, we repnclude that any 
two finite fields of size q are izomorphie. 
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Theorem 8.2.1 

1) Every finite field has size q=p", for some prime p and integer 
n>0. 

2) For every q=p™ there is, up to isomorphism, a unique field 
GF(q) of size q, which is both the set of roots of f,(x) =x4—x 
and the splitting field for f,(x) over Z,. 0 


In view of this theorem, we will often refer to the finite field GF(q). 


Corollary 8.2.2 The extension GF(q) < GF(q") is a Galois extension. 0 


8.3 The Subfields of a Finite Field 


It is easy to determine the subfields a finite field. If F < GF(p™) 
then Lemma 8.1.2 implies that |F| = p% for some d|n. On the other 
hand, we have 


d[n => p?-1[p"-1 > xP _ yj ye 1 > fa) | fyo(x) 


and since f,n(x) splits over GF(p"), so does f 4(x). Thus GF(p") contains 
a splitting field for f 4(x), that is, GF(p™) contains a subfield of size p%. 
Certainly, GF(p") ednnot coutain more than one such subfield, for then 
there would be more than p® roots of the polynomial f A(X) 3 in GF(p”). 


Theorem 8.3.1 The field GF(p") has exactly one subfield of size p%, for 
each d|n. This accounts for all of the subfields of GF(p"). 0 


8.4 The Multiplicative Structure of a Finite Field 
Since GF(q)* is cyclic, Theorem 0.2.11 implies the following theorem. 


Theorem 8.4.1 There are exactly ¢(d) elements of GF(q)* of order d for 
each d|q—1 and this accounts for all of the elements of GF(q)*. 0 


It is customary to refer to any element of GF(q) that generates the 
cyclic group GF(q)* as a primitive element of GF(q). However, this 
brings us into conflict with the term primitive as used earlier to denote 
any element of a field that generates the field using both field operations 
(addition and multiplication). Accordingly, we adopt the following 
definition. 
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Definition Any element of GF(q) that generates the cyclic group GF(q)* 
is called a group primitive element of GF(q). In contrast, if F < E then 
any element a€E for which E=F (qa) is called a field primitive 
element of E over F. 0 


If 8 € GF(q), we may wish to know when the equation 
(8.4.1) xK = 8 


has a solution in GF(q), that is, when @ has a k-th root in GF(q). This 
question has a simple answer in view of the fact that GF(q)* is cyclic. If 
@ is a group primitive element of GF(q) then § = a’ for some i and so 
(8.4.1) has a solution x = o? if and only if 


oki = oi 
for some integer j. This is equivalent to 


kj =i mod (q-1) 
or 
i =kj + n(q-1) 


for some integers n and j. But this holds if and only if 


(k,q—- 1) li 


where (k,q—1) is the greatest common divisor of k and q—1. Thus, 
equation (8.4.1) has a solution for all @€ GF(q) if and only if 
(k,q—1) =1. 


Theorem 8.4.2 


1) Let @ be a group primitive element of GF(q). The equation xk = 
a' has a solution in GF(q) if and only if (k,q—1) |i. 

2) The equation x* = @ has a solution for all 6 € GF(q) if and only if 
(k,q — 1) = 1, in which case the solution is unique. 0 

Theorem 8.4.2 says that if (k,q—1)=1, the function arak is a 

permutation of the elements of GF(q). For this reason, in this case the 
polynomial p(x) = x* is called a permutation polynomial. 
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8.5 The Galois Group of a Finite Field 


Since the extension GF(q) < GF(q") is Galois, if G is the Galois 
group of GF(q") over GF(q) then 


|G| =[GF(q"):GF(q)] =n 
The structure of G could not be simpler, as we now show. 


Theorem 8.5.1 The Galois group G of GF(q") over GF(q) is cyclic of 
order n, generated by the Frébenius automorphism gyal. 


Proof. Since a4 = a for all a € GF(q), we have 7, € G. Moreover, the n 
automorphisms 
n-1 


2 
by Og Ogres OQ 


are distinct elements of G, for if o* =. then a* = a for all a€ GF(q*), 
which implies that k >n. Since |G| =n, we see that G = (c,). 


8.6 Irreducible Polynomials over Finite Fields 


The following theorem. gives some key facts about irreducible 
polynomials over a finite field. 


Theorem 8.6.1 For every finite field GF(q), and every positive integer d, 
there exists an irreducible polynomial p(x) of degree d over GF(q). Let 
a be a root of p(x) in some extension field. 


1) (Splitting Field) The splitting field of p(x) is GF(q)(a) = GF(q°). 
2) (Roots) The roots of p(x) in a splitting field are 

2 d- 

(8.6.1) a, aFat ,...,a9 , 


3) (Degree) d is the smallest positive integer for which at =a. 


4) (Degree) p(x) |x? —x if and only if d|k. Hence, d is the smallest 
positive integer for which p(x) | x? —x. 


5) (Order of Roots) All roots of p(x) have the same multiplicative 
order in GF(q4)*. 


Proof. Note first that since GF(q)< GF(q9) is simple, we have 
GF(q*) = GF(q)(B) and so min(#,GF(q)) is an irreducible polynomial 
of degree d over GF(q). For part 1), since GF(q) < GF(q)(q) is normal, 
p(x) splits in GF(q)(a), whence it is the splitting field for p(x). Also, 
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[GF(q)(«): GF(q)] = deg p(x) = 


and so GF(q)(a) = GF(q9). 
To prove part 2), recall that the Galois group of GF(q?) over GF(q) 
is the cyclic group 


(oJ ={4 Ty tees faa 


Applying these maps to a gives the complete list (8.6. 1) of roots of 
P(x); with no duplicates since any automorphism of GF(q4 ) over GF(q) 
is completely determined by its dalue on a. 

For part 3), since a € GF(q) we have a* =a and it is clear from 
part 2) that no smaller power of, q can have this property. Part 4) 
follows from the fact that p(x) |x —x if and only if the splitting field 
for P(x) i is a subfield of the splitting field for xt —x, that is, if and only 
if GF(q4) < GF(q*). Part 5) follows from the fact that since 0, is an 
automorphism of GF(q°), it preserves multiplicative order and so the 
order of oka is equal to the order of a. 


Definition If p(x) is irreducible over GF(q) then the multiplicative order 
of any root of p(x) in its splitting field is called the order of p(x) and is 
denoted by o(p(x)) or o(p). 0 


Definition A polynomial P(x), over GF(q) of degree d is said to be 
primitive over GF(q) if it is the minimal polynomial of a group 
primitive element of GF(q°), that is, if its order is q4—1.0 


According to part 5) of Theorem 8.6.1, an irreducible polynomial 
over GF(q) of degree d is primitive if and only if all of its roots are 
group primitive in GF(q4 ). Primitive polynomials play an important 
role in finite field arithmetic, as we shall see in the next chapter. 

The following theorem provides a characterization of order. (cf. 
Theorem 8.6.1, part 4).) 


Theorem 8.6.2 Let p(x) € GF(q) be irreducible of order v. Then 
p(x) | xk —1 if and only if v|k. Hence, v is the smallest positive integer 
for which p(x) |x” —1. 


Proof. Suppose first that y|k. Each root a of p(x) satisfies a” -1=0 
and wherefore also a —1. Since p(x) is separable, we conclude that 
p(x) | x — 1. Conversely, if p(x) |x*—1 then any root of p(x) is a root 
of x* — 1 and therefore has order dividing k, whence v|k. § 
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Relationship Between Order and Degree 


There is a simple relationship between the order and degree of an 
irreducible polynomial p(x) over GF\q). Let o(p(x))=v and 
deg p(x) =d and suppose that a € GF(q3) i is a root of p(x). Since 


a =a if and only if v|q*-1 


and since d is the smallest positive integer for which af =a, we 
conclude that d is the smallest positive integer for which v | q@ —1. Put 
another way, d = deg p(x) is the order of q modulo v, written 0,(q). 
Since (v,q)=1, the residue q of q modulo vy lies in Z¥, the 
multiplicative group of elements of Z,, that are relatively prime to v 
and so deg p(x) = 0(q) in the group Z¥. 

By way of converse, suppose that f(x) is a polynomial over GF(q) 
and @ is a root of order v in a splitting field. If deg f(x) = 0,(q) then 
f(x) must be irreducible, since it has the same degree as p(x) = 
min(a,GF(q)) and is divisible by p(x). 


Theorem 8.6.3 Let p(x) be a polynomial over GF(q) of degree d, let a 
be a root of p(x) of order v in a splitting field. Then p(x) is irreducible 
if and only if any of the following equivalent conditions holds. 


1) dis the smallest. positive integer for which v | q4 —1. 
2) dis the smallest positive integer for which at =a. 


3) d= o0,(q) is the order of q modulo v. 0 


Theorem 8.6.3 tells us that the degree of an irreducible polynomial is 
completely determined by its order. It is not true that the order of an 
irreducible polynomial is determined by its degree, as we will see in a 
moment. 

If p(x) € GF(q) is irreducible and has degree d, then GF(q®) is the 
splitting field for p(x). Of course, we may view p(x) as a polynomial 
over any intermediate field GF(q*) where 1 <k<d, in which case it 
may ae longer be irreducible. Let a be a root of p(x) of order v in 
GF(q4 ), and suppone Bu a is a root of the irreducible factor q(x) of 
p(x) over GF(q*). Since q* has order 6 = d/(k,d) in Z*,, we deduce from 
Theorem 8.6.3 that deg q(x) = 6. 


Theorem 8.6.4 Let p(x) be irreducible of degree d over GF(q). When 
thought of as a polynomial over GF(q* ), where 1<k<d, the 
polynomial p(x) can be factored into (k,d) irreducible factors, each of 
which has degree d/(k,d). In particular, p(x) is irreducible over GF(q*) 
if and only if (k,d) =1.0 
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Computing the Order of a Polynomial 


We now present a procedure for finding the order v of an irreducible 
polynomial f(x) of deere d. Let p be a prime sdisading q? —1 and 
suppose that q’ —1=p‘u with pfu and v=p‘v with p fv. Since 
y|q4 —1 we have s<t and p ae ~ 1 if and only if w<t—s. _ 
the largest w for which p“» | q? — 1 satisfies w = t —s, that is, s =t— 
Thus, the largest value of w for which 


Ww 


d_ 
y |S 
or equivalently by Theorem 8.6.2, 


d 
f(x) | x(4 -1)/p™ = 


t-—w 


gives the largest power p of p dividing v. Doing this for all primes 
dividing q4— 1 gives the value of v. 


Example 8.6.1 Consider the irreducible polynomial f(x) =x°+x+1 
over GF(2). Since q = 2, we have 


q® —1=63 = 37-7 
Let p = 3. Division shows that 


f(x) {x®9/9 1, f(x) fx®/3 —1, f(x) | x8 - 


and sow=0, s=t—w=2-0 = 2, whence 3? is the largest power of 3 
dividing v. For p = 7 division gives - 


f(x) {x®/7_—1, (x) [x8 - 
and so 7 is the largest power of 7 dividing v. Thus v = 37-7 = 63 
showing that f(x) is Pairs. Oe GF (2). 
The polynomial g(x) = x®°+x4+x?+x+41 is also irreducible over 
GF(2). In this case for p = 3 we have 
a(x) {x°9/9 —1, f(x) [x99 —1 
and so w = 1, whence 3|v but 3?/v. For p =7 we have 


f(x) {x99/7 —1, £(x) | x8 


hence 7|v and v = 21. Note that both of these polynomials have degree 
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6 but they have different orders. This shows that the degree of an 
irreducible polynomial does not determine its order. 0 


*8.7 Normal Bases 


We saw in Chapter 7 that if F < E is a finite Galois extension and F 
is an infinite field, then E has a normal basis over F. To prove an 
analogous theorem when E is a finite field, we require a result from 
linear algebra, which we will not prove here. If T:V—-V is a linear 
operator on an n-dimensional vector space V over a field F, then the 
minimal polynomial m-(x) for T is the unique monic polynomial over F 
of smallest degree for which m7;(T)=0. Since T satisfies its 
characteristic polynomial c(x) = det(xI— Mat(T)), we have 
m-y(x) | e-p(x). A vector v € V is said to be cyclic for T if the vectors 


{v,Tv,T2v,...,T"-4v} 


form a basis for V. Here is the result that we need. 


Theorem 8.7.1 Let T:V—V be a linear operator on a finite-dimensional 
vector space V over a field F. Then V contains a cyclic vector for T if 
and only if the minimal polynomial m7(x) and the characteristic 
polynomial c(x) are the same. 0 


Now we can establish the existence of normal bases for finite fields. 


-1 
Theorem 8.7.2 There exists a normal basis {a, a4,...,a7" } for 
GF(q") over GF(q). 


Proof. If n= 1, there is nothing to prove, so assume that n> 1. The 
Galois group of GF(q") over GF(q) is 


G= {1,074,072 Aisa ont} 


where o,:a—a%. By the Dedekind Independence Theorem, these maps 
are linearly independent over GF(q"). Thus, thinking of o, as a linear 
operator on the n-dimensional vector space GF(q") over GF(q), we see 
that o,, satisfies the polynomial x" — 1 and no polynomial over GF(q) of 
-smaller degree. Hence x"—1 is the minimal polynomial of 7, over 
GF(q). On the other hand, the characteristic polynomial of o, has 
degree n, is monic, and is divisible by x"—1, and so it must also be 
x"—1. By the previous theorem, there exists a cyclic vector a for o 
and so 


q 


n-1 
qa q 


is a normal basis for GF(q") over GF(q). & 


Q, T,M,...,00 a 
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*8.8 The Algebraic Closure of a Finite Field 


In this section, we determine the algebraic closure of a finite field 
GF(q). Since GF(q) < GF(q®) is algebraic for all positive integers n, an 
algebraic closure of GF(q) must contain all of the fields GF(q"). Since 
n! | (n+1)!, it follows that 


GF(q"") < GF(q®t)') 
and so the union 


ra) = (J ert") 


n=0 


is an extension field of GF(q) that contains GF(q™), for all n > 1. 


Theorem 8.8.1 The field I'(q) is the algebraic closure of GF(q). 


Proof. Every element of I(q) lies in some GF(q™), whence it is 
algebraic over GF(q). Thus I'(q) is algebraic over GF(q). Now suppose 
that I'(q) < E is algebraic and let a € E have minimal polynomial p(x) 
over GF(q). If deg p(x) =d, then p(x) splits in G¥F(q4), which is 
contained in I'(q). Hence a €I'(q) and so E <I(q). Thus, ['(q) has no 
proper algebraic extensions. §f 


Steinitz Numbers 


We wish now to describe the subfields of the algebraic closure '(q a 
Recall that a field K is a subfield of GF(q™) if and only if K = GF(q°) 
where d|n. The set Nt of positive integers is a complete lattice where 
m An = gcd(m,n) and m Vn = Icm(m,n). If we denote by %, the set of 
all finite fields (or more properly the set of all isomorphism classes of 
finite fields) that contain GF(q), then F, is also a complete lattice 
where EAF = ENF and EVF = EF. 


Theorem 8.8.2 The map ¥:Nt—¥, defined by ¥(n) = GF(q") is an 
order-preserving bijection. Hence, it is an isomorphism of lattices, that 
is, 


1) nm if and only if GF(q") < GF(q™), 
2) GF(q") Nn GF(a™) = GF(an™), 

3) GF(q")GF(q™) = GF(q?Y™). 

Proof. Left to the reader. I 
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It is clear that the lattice of intermediate fields between GF(q) and 
GF(q") is isomorphic to the sublattice of N* consisting of all positive 
integers dividing n. In order to describe the lattice of intermediate fields 
between GF(q) and I'(q), we make the following definition. 


Definition A Steinitz number is an expression of the form 
fe) " 
S= Il P;! 
i=1 


where p, is the i-th prime and e; € {0,1,2,... } U {oo}. We denote the set 
of all Steinitz numbers by S. Two Steinitz numbers are equal if and 
only if the exponents of corresponding prime numbers p, are equal. 1] 


We will denote arbitrary Steinitz numbers using upper case letters 
and reserve lower case letters strictly for ordinary positive integers. We 
will take certain obvious liberties when writing Steinitz numbers, such 
as omitting factors with a 0 exponent. Thus, any positive integer is a 
Steinitz number. We next define the algebra of Steinitz numbers. 


Definition Let S = [| p;i and T = Tips be Steinitz numbers. 
1) The product and quotient of S and T are defined by 
ry et ry _e.—f 
ST= [pi i and S/T= [Lp i 
i=1 i=1 


where oo — co = 0. 


2) We say that S divides T and write S| T if e; < f, for all i. 0 


Theorem 8.8.3 Under the relation of “divides” given in the previous 
definition, the set S is a complete distributive lattice, with meet and 
join given by 


ry min(e,,f,) os rt max(e:,f,) 
SAT= Ile: mi and SvT= [I[p; mi 
i=1 i=1 


Moreover, the set of positive integers is a sublattice of S. 0 


Subfields of the Algebraic Closure 


We can now describe the subfields of I'(q). Let #([(q)) denote the 
lattice of all subfields of ['(q) that contain GF(q). 
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Definition If S is a Steinitz number, let 


GF(qS) = LU GF(a°) 
a[s 


where, as indicated by the notation, d is an ordinary positive integer. O 


If a, 6 € GF(q°) then a € GF(q*) for some k |S and 8 € GF(q") for 
some n|S. Thus a, @ € GF(q™) where m =Icm(k,n). It follows that 
GF(q§) is a subfield of I'(q) containing GF(q). 


Theorem 8.8.4 The map #:S—¥(I'(q)) defined by ¥(S) = GF(q°) is an 
order preserving bijection. Hence, it is an isomorphism of lattices, that 
is, 


1) S|T if and only if GF(q5) < GF(q°), 

2)  GF(q°)n GF(q") = GF(q°**), 

3) GF(q8)GF(q") = GF(a° Y"). 

In addition, GF(q°) is finite if and only if S is a positive integer. 


Proof. We begin by showing that n|S$ if and only if GF(q") < GF(q°). 
One direction follows immediately from the definition: if n|S then 
GF(q") < GF(q5). Suppose that GF(q") < GF(q5). Let a be a field 
primitive element of GF(q") over GF(q). Then a € GF(q§) and so a € 
GF(q*) for some d|S. Hence GF(q") = GF(q)(a) < GF(q3), which 
implies that n|d, whence n |S. 

To see that 7 is injective, suppose that S # T. We may assume that 
there exists an integer n>1 such that n|S but n{T. Then 
GF(q") < GF(q8) but GF(q") ¢ GF(q") and so GF(q5) # GF(q"). 

To see that w is surjective, let GF(q) < F <I'(q). We must find an S 
for which GF(q°) =F. For each prime p,, let e; be the largest power of 
p; for which 


e. 
(8.8.1) GFqPi) <F 


where e; = co if (8.8.1) holds for all positive integers e;. Let 


CO 
s= [lv 


i=1 


mio. 
d= [[ pi 


i=1 


If d|S then 


for some m € Nt, where f, < e; and f; < oo. Hence 
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fj ej 
GF(qPi) < GF(qhi) <F 
for i= 1,...,m and so 


fj 
GF(q9) = V GF(qPi) <F 


It follows that GF(q5) <F. Now, if a€F then a€ GF(q") < F for 
some n. If 


then ” 
GF(q?i) < GF(q") <F 


and so g; <e; for all i, by the maximality of e;. Hence n|S and so a € 
GF(q") < GF(q5). This shows that F < GF(q5). Hence F = GF(q°) and 
so w is surjective. We leave the rest of the proof to the reader. &f 


Exercises 
1. Show that 
d[n = p?—1|p"-1 => f(x) | fool) 


2. IsZ,xF,? Is Z,%F,? When is Zi Fa? 

3. Determine the number of subfields of Fj 99,. Determine the 
number of subfields of Fy. 

4. Show that, except for the case of F,, the sum of all of the 
elements in a finite field is equal to 0. 

5. Find all group primitive elements of F,. 

6. Show that the polynomial x*+x?+4x?+4+x+1 is irreducible over 
F,. Is it primitive? 

7. Let F be an arbitrary field. Prove that if F* is cyclic then F must 
be a finite field. 

8. Consider the irreducible polynomial p(x) = x*—2 over Q. Show 
that adjoining one root of p(x) to Q does not produce the splitting 
field for p(x). What is the degree of the splitting field for p(x) 
over Q? 


Find the order of the following irreducible polynomials. 


9. xt4x34x?4x41 over GF(2). 
10. x4+x+1 over GF(2). 

11. x84+xt4x3+4x241 over GF(2). 
12, x84+x°4x44x3+41 over GF(2). 
13. x84x74x°4x41 over GF(2). 
14. x4+x+42 over GF(3). 
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x?4x34x741 over GF(3). 

x°—x+1 over GF(3). ; 

Show that every element in GF(q") has a unique q'-th root, for 
i=l,....n—1. 

If 2/q, show that exactly one-half of the nonzero elements of 
GF(q) have square roots. Hint. Let @ be a primitive element of 
GF(q). If 8 = 7, then a?* = « for some k. 

Show that if a € GF(q) and n is a positive integer, then x1-x+a 
divides x" —x-+na. Hint: show that roots of the former are roots 
of the latter. 

Find a normal basis for GF(8) over GF(2). Hint. Let a be a root 
of the irreducible polynomial x? + x? + 1. 

Show that ['(q) = U2 5GF(aq"). 

Show that I'(q") = T(q™). 

Let F be a field F satisfying GF(q) < F <I(q). Show that all of 
the proper subfields of F are finite if and only if F is finite or F = 
GF(q>) where S = p® for some prime p. 

Show that I'(q) has no maximal subfields. 

Show that [I'(q):F] is not finite for any proper subfield F < I'(q). 
Show that [(q) has an uncountable number of nonisomorphic 
subfields. 

Let S| T. Show that [GF(q!):GF(q5)] is finite if and only if T/S is 
finite, in which case the two numbers are equal. Hint: consider the 
intermediate fields. 


Chapter 9 
Finite Fields II: Additional Properties 


9.1 Finite Field Arithmetic 


There are several ways in which to represent the elements of a finite 
field. One way is to use a factor ring GF(q)[x]/(p(x)), where p(x) is 
irreducible. Another is to use the fact that GF(q)* is cyclic, and so its 
elements are all powers of a group primitive element. It is clear that 
addition is more easily performed when field elements are written as 
polynomials and multiplication is more easily performed when all 
elements are written as a power of a single group primitive element. 
Fortunately, the two methods can be combined to provide an effective 
means for doing finite field arithmetic. 


Example 9.1.1 Consider the finite field GF(16) as an extension of 
GF(2). The polynomial 
p(x) =x4+x+1 


is irreducible over GF(2). To see this, note that if p(x) is reducible, it 
must have either a linear or a quadratic factor. But since p(0) #0 and 
p(1) #0, it has no linear factors. To see that p(x) has no quadratic 
factors, note that there are precisely four quadratic polynomials over 
GF(2), namely, 


x’, re oe Poe x24x4+1 


and it is easy to check that no product of any two of these polynomials 
equals p(x). Since deg p(x) = 4, we have 
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GF(2)[x] 


wy 


Thus, letting a be a root of p(x), we can represent the elements of 
GF(16) as the 16 binary polynomials of degree 3 or less in a: 
Constant: 0,1, 
Linear: a,at+1 
Quadratic: a’, a? +1, a +a,a7+a41 
Cubic: o, oF +1, a +a, a +07, a +041, 
ao + a7 +1, +a? +a, e+ar*+atl 


Addition of elements of GF(16) is quite simple, since it is just 
addition of polynomials, but multiplication requires reduction modulo 
p(q), that is, using the relation a* = a+1. On the other hand, observe 
that 


al = (a5)3 = (a-a4)® = (a- (at) = a%(a+1)® 

=a?.(eF+a7+a41)=a%+a0°+a4t+03 

= (a3 +a? )+(a? +a) +(a+1) +03 

= (a3 +07) + (a7 +a)+(a4+1)+a3 =1 
and so o(a)|15. Since a? #1 and a° # 1, we conclude that a is group 
primitive. Hence 

GF(16) = {0,1,a,...,014} 
With this representation, multiplication is all but trivial, but addition 
is cumbersome. 
We can link the two representations of GF(16) by computing a table 


showing how each element a* can be te presented as a polynomial in a 
of degree at most 3. Using the fact that at = 1+, we have 


at=at+l 

ab saat =a(atl)=a' +a 
a® =a-a =a +a 

a =a-c& =at+oe%=a2+a41 


and so on. The complete list, given in Table 9.1.1, is known as a meld 
table for GF(16). As is customary, we write only the exponent k for aX ’ 
and a38,a for the polynomial a,a° + aya? + aja +a. 
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Table 9.1.1 


WOBOanoahL wWNe © 


Computations using this table are quite straightforward; for example, 


(a® + a4 + 1)(a? + a) = (0101 + 0011 + 0001)(1000 + 0010) 


= (0111)(1010) 


= q!9. 99 = 19 = a4 =a41 


Thus, the key to doing arithmetic in a finite field is having a group 
primitive element, along with its minimal (primitive) polynomial. In 
general, the task of finding primitive polynomials is not easy. There are 
various methods that achieve some measure of success in certain cases, 
and we mention one such method at the end of Section 11.2. 
Fortunately, extensive tables of primitive polynomials and field tables 
have been constructed. 

Let us use the primitive polynomial p(x) and the field table for 
GF(16) (Table 9.1.1) to compute the minimal polynomial over GF(2) 
for each element of GF(16). We begin by computing sets of conjugates 
using Theorem 8.6.1 and the fact that a!® = a, 


Conjugates ofa: a, a’, a4, a® 


Conjugates of a: a3, a®, a!?, a4 = 09 


Conjugates of a®: a, a° 


Conjugates of a”: ava", a =a’, a? =a 
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Letting m,(x) be the minimal polynomial for aX, we have, for example 
mg(x) = myo(x) = (x—a)(x— a!) = x? — (aF+a!)x + a8 
The field table for GF(16) gives 
a® + 9 = (0110) + (0111) = (0001) = a° =1 
and since a!5 = 1, we have 
ms(x) = Myo(x) =x? +x+41 
The other minimal polynomials are computed similarly. The complete 
list is 
m,(x) =x+1 
m,(x) = ma(x) = m,(x) = mg(x) = x4+x+1 
ma(x) = mg(x) = mo(x) = myo(x) =x4+x° +x? +x41 
m,(x) = myo(x) = x7 +x+1 
me(x) = my, (x) = my3(x) = my 4(x) = x4 +x° +1 
Being able to factor polynomials of the form x™—1 is important for 
a variety of applications of finite field theory, especially to coding 


theory. Since the roots of x!°~1 over GF(2) are precisely the elements 
of GF(16)*, we have 


x! — 1 = mo(x)m, (x)m(x)m,(x)m7(x) 


Of course, in order to obtain this factorization, we worked in the 
splitting field GF(16). In Chapter 10, we will see how to factor a 
polynomial of the form x"—1 into a product of not necessarily 
irreducible factors, working only within the base field. 0 


*9.2 The Number of Irreducible Polynomials 


Of course, if F is a finite field, then there are only a finite number of 
polynomials of a given degree d over F. It is possible to obtain an 
explicit formula for the number of irreducible polynomials of a degree d 
over GF(q) by using Mébius inversion. (See the appendix for a 
discussion of Mobius inversion.) First, we need the following result. 
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Theorem 9.2.1 Let GF(q) be a finite field, and let n be a positive 
integer. Then the product of all monic irreducible polynomials over 
GF(q), whose degrees divide n, is 


= x"_ 
f(x) = x9 —x 


Proof. According to Theorem 8.6.1, an irreducible polynomial p(x) 
divides f(x) if and only if deg p(x)|n. Hence, f(x) is a product of 
irreducible polynomials whose degrees divide n and every irreducible 
polynomial whose degree divides n divides f, (x). Since no two such 
irreducible polynomials have any roots in common and since fy (x) has 
no multiple roots, the result follows. 


Let us denote the number of monic irreducible polynomials of degree 
d over GF(q) by N,(d). By counting degrees, Theorem 9.2.1 gives the 


following. 


Corollary 9.2.2 For all positive integers d and n, we have 


q’= )> dN,(d) O 


d|[n 


Now we can apply Mobius inversion to get an explicit formula for 
N,(d). Classical Mébius inversion is 


(9.2.1) g(n) = » fd) = f(n) = » a(4)-(5) 
djn d|n 


where the Mébius function p is defined by 


1 if m=1 
p(n) = (-1)k if m= PiP9'':P, for distinct primes p; 
otherwise 


Corollary 9.2.3 The number N q(") of monic irreducible polynomials of 
degree n over GF(q) is 


Na(n) = £ ou@a?=2 Do ua@a4 


d|n d|n 


Proof. Letting g(n)=q" and f(d)=dN,(d) in (9.2.1), we get the 
formula above. & 
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Example 9.2.1 The number of monic irreducible polynomials of degree 
12 over GF(q) is 


Nq(12) = sy(m(1)al? + u(2)° + u(3)q4 + u(4)q? + 1(6)q? + 4(12)a) 
= 1a? - a -a' +4?) 
The number of monic irreducible polynomials of degree 4 over GF(2) is 
No(4) = H{u(1)24 + u(2)2? + u(4)2#) = 3 
as we would expect from the results of Example 9.1.1. 0 


MObius inversion can be used to find not only the number of monic 
irreducible polynomials of degree d over GF(q) but also the product of 
all such polynomials. Let us denote this product by I(q,d;x). Then 
Theorem 9.2.1 is equivalent to 


x? —x = Il I(q,d;x) 
d | n 


Applying the multiplicative version of Mobius inversion gives the 
following. 


Corollary 9.2.4 The product I(q,n;x) of all monic irreducible 
polynomials of degree n over GF(q) is 


n/d d 
I(q,n;x) = TI (*-x)" 14) 2 TI (x*—2)" ) q 
d | n d | n 
Example 9.2.2 For q = 2 and n = 4, we get 
1(2,4;x) = (x16 a x) HOD (x4 = x)H()(,2 i x)H(4) 


16 15 
= XK a Kd 1 94 8 4 qo 
x*—-x x°-1 


*9.3 Polynomial Functions 


Finite fields have the special property that any function from a finite 
field F to itself can be represented by a polynomial. As a matter of fact, 
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this property actually characterizes finite fields from among. all 
commutative rings (finite and infinite)! 

Since GF(q) has size q, there are precisely q? functions from GF(q) 
to itself. Among these functions are the polynomial functions at+p(a) 
where p(x) € GF(q)[x]. We will denote this polynomial function by p(x) 
as well. If p(x) and q(x) are polynomial functions on GF(q) then 
p(x) = q(x) as functions if and only if p(a) =q(qa) for all a € GF(q), 
which holds if and only if 


x4 — x | p(x) — q(x) 


Thus, two polynomials represent the same function if and only if they 
are congruent modulo x?—x. Since every polynomial is congruent 
modulo x?—x to precisely one polynomial of degree less than q 
(namely, its remainder after dividing by x1—x), and since there are q®? 
polynomials of degree less than q, we have the following theorem. 
(Proof of the last statement in part 2 of the theorem is left to the 
reader.) 


Theorem 9.3.1 

1) Two polynomials over GF(q) represent the same polynomial 
function on GF(q) if and only if they are congruent modulo 
xd_x, 

2) Every function f:GF(q)-GF(q) is a polynomial function, for a 
unique polynomial of degree less than q. In fact, the unique 
polynomial of degree less than q that represents f is 


p(x)= 2 f(a)(1-(x-a)9) o 


a € GF(q) 


Note that the representation of f given in part 2) above is the 
Lagrange interpolation formula as applied to finite fields. Part 2) has a 
very interesting converse as well. 


Theorem 9.3.2 If R is a commutative ring and if every function f:R-R 
is a polynomial function, for some p(x) € R[x], then R is a finite field. 


Proof. First, we show that R must be finite. SUPP that [R| =4. 
The number of functions from R to itself is \* and the number of 
polynomials over R is the same as the number of finite sequences with 
elements from R, which is Np. Since oe functions are represented 
by distinct polynomials we oiiiat have \7 <NpA, which only happens 
when 4 is finite. Thus, R is a finite set. 

Now let r, a€ R with r # 0. Define a function f, ,:R—-R by 
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a ifx=r 


fale) = 0 ifx fr 


By hypothesis, there exists a polynomial ay + a,x +-+-+ a,x" for which 


ap tarts tarm=a 
and 


ag tayx+-:++a,x"=0, forxf#r 
Setting x = 0 gives ay) = 0 and so 
r(a, tar +++: t+a,r!) =a 


Thus, we conclude that for any r#0 and any aE R, there isauER 
for which ru = a. In other words, the map y,:R-R defined by y,s = rs 
is surjective. Since R is a finite set, %, must also be injective. Hence, 
ts =0, r#0 implies that s=0 and so R has no zero divisors. In 
addition, since y, is surjective, there exists a u€ R for which ¥,u =r, 
that is, ru =r. If aeé R then aru = ar and since R is commutative and 
has no zero divisors, we may cancel r to get au =a. Thus u€ R is the 
multiplicative identity of R. Hence R is a finite integral domain, that 
is, a finite field. § 


*9.4 Linearized Polynomials 


We now turn to a discussion of linear operators on GF(q") over 
GF(q). We will see that all such linear operators can be expressed as 
polynomial functions of a very special type. 


Definition A polynomial of the form 
-_ i 
L(x) = rs 
i=0 


with coefficients a; € GF(q") is called a linearized polynomial, or a q- 
polynomial, over GF(q"). 0 


The term linearized polynomial comes from the following theorem, 
whose proof is left to the reader. 
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Theorem 9.4.1 Let L(x) be a linearized polynomial over GF(q"). If a, 
8B € GF(q") and a, b € GF(q), then 


L(aa + bf) = aL(a)b + L(B) 


Thus, the polynomial function L(x):GF(q")->GF(q") is a_ linear 
operator on GF(q") over GF(q). 0 


The roots of a q-polynomial in a splitting field have some rather 
special properties, which we give in the next two theorems. 


Theorem 9.4.2 Let L(x) be a nonzero q-polynomial over GF(q"), with 
splitting field GF(q*). Then each root of L(x) in GF(q*) has the same 
multiplicity, which must be either 1 or else a power of q. Furthermore, 
the roots of L(x) form a vector subspace of GF(q*) over GF(q). 


Proof. Since L’(x) = ap, if ag #0 then all roots of L(x) are simple. On 
the other hand, suppose that ag =a, =:::=a,_,=0 but a, 0. 
Then since a, € GF(q"), we have 


and so 


L(x) = y oxt = = SS os xa =(Se aa Ka ai 


i=k i=k 


which is the q th power of a linearized polynomial with nonzero 
constant term, pod therefore only simple roots. Hence, each root of L(x) 
has multiplicity dk. We leave proof of the fact that the roots form a 
vector subspace of GF(q°) to the reader. § 


The following theorem, whose proof we omit, is a sort of converse to 
Theorem 9.4.1. (For a proof of this theorem, and more on q- 
polynomials, see the book by Lidl and Niederreiter (1986).) 


Theorem 9.4.3 Let U be a vector subspace of GF(q") over GF(q). Then 
for any nonnegative integer k, the polynomial 


L(x) = Il (x- a)" 


aéU 
is a q-polynomial over GF(q"). 0 


If L(x) is a q-polynomial, then as a function, we have 
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m i m le 
L:ar+L(a) = > aad = Yo ajeia 


i=0 i=0 


where @, is the Frobenius automorphism. Thus, as an operator 


m _ 
=— 1 
L= 0% 


i=0 


is a linear combination over GF(q") of the automorphisms: ci. Since 
o%G = 4 we may reduce the expression for L to a polynomial in oq of 
degree at most n—1. In fact, adding 0 coefficients if necessary, we cai 


say that every q-polynomial fancuion on GF(q") has the standard form 


n-1 


L= Yo aoi, 


i=0 


for a; € GF(q"). There are q™ n such q-polynomial functions on GF(q"), 
and this happens also to be the number of linear operators on GF(q") 
over GF(q). Moreover, since the maps o’ are linearly independent over 
GF(q"), we deduce that each q-polynomial in standard form represents 
a unique linear operator. Thus, we have characterized the linear 
operators on GF(q") over GF(q). 


Theorem 9.4.4 Every linear operator on GF(q") over GF(q) can be 
represented by a unique q-polynomial in standard form 


n-1 


L(x) = 3 aux? 


i=0 
for some a; € GF(q"). 0 


Exercises 
1. Factor x>—1 over 
(a) F, | (b) Fs 
2. Factor x‘'—1 over 
(a) Fo _ (b) F (c) Fs 
3. Factor x°—1 over 
(a) Fp (b) Fz (c) Fy (d) Fs 
4. Factor x°-—1 over 
(a) F, (b) Fs 
5. Factor x!%—1 over 
(a) F, (b) Fs 
6. Calculate N,(20). 
7. Show that 


N4(2) < x(a" —4) 
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10. 


11. 


and 


[n/2] 
Pio SAN, (4) < <nN,(n) + es q’ <nN,(n) qq tel? 
d|n 
Hence, N,(n) > Aq —q'+*/?), Finally, show that N q() ¥ q"/n. 
Show that the unique polynomial of degree ie than q that 
represents the function f:GF(q)—-GF(q) is 


p(x)= > f(a)(1—-(x-a)*) 


a € GF(q) 


Prove that a linearized polynomial over GF(q") is a linear 
operator on GF(q") over GF(q). 

Prove that the roots of a q-polynomial over GF(q") form a vector 
subspace of the splitting field GF(q*) over GF(q). 

Prove that the greatest common divisor of two q-polynomials over 
GF(q") is a q-polynomial, but the least common multiple need not 
be a q-polynomial. 


Part 3 
The Theory of Binomials 


Chapter 10 
The Roots of Unity 


Polynomials of the form x"—u, where 0#u€F, are known as 
binomials. Even though binomials have a simple form, their study is 
quite involved, as is evidenced by the fact that the Galois group of a 
binomial is often nonabelian. As we will see, an understanding of the 
binomial x" — 1 is key to an understanding of all binomials. 

We can illustrate the interplay between the binomials x"—1 and 
x"—u, for 0 #u€ F as follows. Let E be the splitting field for x"-1 
(with n odd) over F and let S be the splitting field for x" —u over F. It 
is not hard to show that 


F<E<S 


for if r and s are roots of x"—1u then r/s is a root of x" —1. We will see 
in a later chapter that if E=F, that is, if x"—1 splits over F, then 
F <S is abelian (in fact, cyclic). On the other hand, in the opposite 
extreme where [E:F] is as large as possible, then F < § is abelian if and 
only if S = E, that is, if and only if x" —u splits over E. 


10.1 Roots of Unity 


The roots of the binomial x" — 1 over a field F are referred to as the 
n-th roots of unity over F. Throughout this section, we will let F be a 
field with p = expchar(F), S a splitting field for x"—1 over F and U, 
the set of n-th roots of unity over F in S. Notice that if n = kp then 
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x? — 1 = xkP_1 = (x*_1)P 


and so the n-th roots of unity are the same as the k-th roots of unity, 
taken with a higher multiplicity. Thus, from now on, we assume that 
(n,p) =1. 


Theorem 10.1.1 The set U,, of n-th roots of unity over F is a cyclic 
group of order n under multiplication. Moreover, if (m,n) = 1 then 


Unn = Um Up, 


where x is the internal direct product of groups. 


Proof. Clearly a, 6 € U,, implies af, ate U,, Hence, U,, is a subgroup 
of the abelian group S* of nonzero elements of S. Since D(x"—1) = 
nx"! 40, we deduce that x"—1 is separable and therefore has n 
distinct roots, whence |U,| =n. If m<n is the smallest positive 
integer for which a™ = 1 for all a € U,, then all n elements of U,, are 
roots of x™—1, implying that m > n, whence m = n. Thus, the smallest 
exponent of U,, is | U,,| and Theorem 0.2.11 implies that U,, is cyclic. 

For the second part, if a€U,,NU, then a“ =1=a" and since 
(m,n) = 1 there exist a, b € Z such that am + bn = 1, whence 

a= q2amt+bn a q2Mybn =1 

which shows that U,,NU, = {1}. It follows that the mn products in 
the set U,,U,, are distinct. Since U,,U, C U,,, and |U,,U,| =mn= 
| Unn |, we have U_,,, = U,,U,, and thus U,,, = U,xU,-. ¥ 


Definition An element w € U,, of order n, that is, a generator of U,, is 
called a primitive n-th root of unity over F. We shall denote the set of 
all primitive n-th roots of unity over F by Q, and reserve the notation 
w,, for a primitive n-th root of unity. 0 


Note that a primitive n-th root of unity w is a field primitive element 
of S, since F(w) = F(U,) = 8S. However, in general, S has field primitive 
elements that are not primitive n-th roots of unity. 


Theorem 10.1.2 


1) IfweEQ, then Q, = {w*|1<k <n, (n,k) = 1} and [Q,| = ¢(n). 
2) Ifd|n then 29 =Q, /4. 
3) If (n,m) = 1 then QL, = ,,0,- 


Proof. Part 1) follows from the theory of cyclic groups (see Theorem 
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0.2.10). For part 2), if d=n the result is trivial, so assume that d <n. 
If w, € Q, then 


ou) = y= 3 


a so wd EQ n/a" Thus ad an?) n/a: For the reverse inclusion, let @ € 

a Then BEU, and so B =u" for some k, where wE€Q,. Since 
a =n/d, Theorem 0.2.11 implies that k=rd for some r satisfying 
(r,n/d) = 1 and so 8 =u* rd Now, if every prime dividing n also divides 
r, then we would have n/d = 1, contrary to assumption. Hence, we may 
let b=r-+a(n/d), where a> 1 is the product of all primes dividing n 
but not r. Then (b,n) = 1. To see this, suppose that p is a prime and 
p|n. There are two possibilities: (i) if p|r then pj/a and p/(n/d), 
whence pj{a(n/d). Hence, p cannot divide r+a(n/d) =b; (ii) if BA? 
then p|a and so p| a(n/d), and again p cannot divide eae 
Thus, (b,n) = 1 and so w° € Q,,. Finally, 


B = wed = wrdtan = wed = (w>)4 € ad 


For part 3), clearly ww, € Upp: If (wya,)* = 1 then since (m,n) = 
1, we have 


wk = woke UN, = {1} 


m 


and so m|k and n|[k, whence mn|k. Thus o(w,,w,) =mn and 
QQ, C Qyn Since all of the products in U,U,, are distinct, so are all 


mon=+ 


of the arocucts in 2,,0,, and so 
| 2,2 | = $(m)¢(n) = (mn) = |Q,, | 


Hence, 2,0, = Quan 


10.2 Cyclotomic Extensions 


Definition Let F be a field. The splitting field S of x"-1 over F is 
called a cyclotomic extension of order n of F. 0] 


(Cyclotomy is the process of dividing a circle into equal parts, which is 
precisely the effect obtained by plotting the n-th roots of unity over Q 
in the complex plane.) 

To determine the degree of S over F, note that S = F(w,) and so 


[S:F] = deg min(w,,F) 
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Since S is the splitting field of a separable polynomial, it follows that 
F <S is a finite Galois extension and we can get a better handle on its 
degree by looking at the Galois group G,(S). 

Any o € G;(S) is uniquely determined by its value on any wE€Q,, 
and since o preserves order, ow must be one of the ¢(n) primitive roots 
of unity in S, that is, 
ow = wile) 
where k(o) € Z*, the multiplicative group of integers in Z, that are 
relatively prime to n. 

Thus, we may define a map :G,(S)—Z* by 


(10.2.1) go = k(c) 


Since 
(or)w = o(wk()) = (ow)k(7) = yk(o)k(7) 
we have 


bor) = k(o)k(r) = ($o)(yr) 


and so ~ is a homomorphism. Since k(o) = 1 implies that o =., the 
map ? is a monomorphism and thus G,(S) is isomorphic to a subgroup 
of Z*. 


Theorem 10.2.1 If F <S is a cyclotomic extension of order n then Gp(S) 
is isomorphic to a subgroup of Z*. Hence, Gp(S) is abelian and [S:F] 
divides $(n). 0 


Since the structure of Z* is clearly important, we record the following 
theorem, whose proof is left as an exercise. 


Theorem 10.2.2 Let n = [[1,, where the r; = pi are powers of distinct 
prime numbers. Then 


z= IIE, 
i 


Moreover, Z* is cyclic if and only if n = p®, 2p° or 4, where p is an odd 
prime. Q 


Corollary 10.2.3 A cyclotomic extension F <§ is abelian and if n = p*, 
2p* or 4, where p is an odd prime, then F < S is cyclic. 0 


10 The Roots of Unity 193 


Finite Fields 


For finite fields, we can improve upon Theorem 10.2.1. In particular, 
if F = GF(q) is a finite field then S is also a finite field and the Galois 
group G;(S) is cyclic with generator gga", Hence, if w is defined by 
(10.2.1), then Im y is the cyclic subgroup of Z* generated by po, Since 


Pe: eee | 
Tv =W =W 


where q is the residue of q modulo n, we have yo, =q and Imyp= 
(q) < Z* and so ¥:G_(S)—(q) is an isomorphism. In particular, 


[S:F] = | Gp(S) | = o(4) 


Note that we already knew this from Theorem 8.6.3, since 
min(w,GF(q)) has order n, and therefore degree o0(q). 


Theorem 10.2.4 Let S be the splitting field for x"—1 over GF(q), where 
(q,n) = 1. Then 


1) S=GF(q’n), 
2)  Gp(S) = (4) is isomorphic to the cyclic subgroup (q) of Zh. 0 
We should make a remark about the relationship between group 


primitive elements of S and primitive n-th roots of unity. A group 
primitive element @ generates S* 


St = {1, 8, 6...) 
whereas a primitive n-th root of unity w generates U,, 
U, = {1, », iW pied 
If 8 is a group primitive element of S then 0(f) = qt) —1 and so 


q@D_1 


(es) = —_ 
= (k,q"(@-1) 


Since n | q°C9) —1, we may write q°( —1 = nr and so 


ky _ nr 
se (k,nr) 


Hence gx is a primitive n-th root of unity if and only if nr/(k,nr) =n, 


194 10 The Roots of Unity 


that is, if and only if r=(k,nr). But this holds if and only if k = ur 
where (u,n) = 1. 


Theorem 10.2.5 Let @ - a group primitive element of the cyclotomic 
extension F < S. Then /* is a primitive n-th root of unity if and only if 
on(@) _y 


qs 
k= ia | aes 


where 1 <u <n and (u,n) = 1.0 


The General Case 


Returning to the general case, we can at least say some interesting 
things about when the Galois group is isomorphic to Z*. Let w be a 
primitive n-th root of unity over F. Since S = F(w), each o € GF(S) is 
uniquely determined by its value on w and so the elements ow are 
distinct and are the roots of min(w,F). Hence, 


min(w,F) = II (x — ow) 


o € Gp(S) 


Since ow = w for some k € Z* and since Gp(S) i isomorphic to Z? if 
and only if there is a o € G,(8) satisfying ow = w* for every k € z, it 
follows that Gp(S) is isomorphic to Z* if and only if 


min(w,F) = II (x - wk) def Q(x) 


(k,n) =1 


where Q,(x) is the polynomial whose roots are the primitive n-th roots 
of unity in S. Since Q,(w) =0, this holds if and only if Q(x) is 
irreducible over F. The polynomial Q,(x) is called the n-th cyclotomic 
polynomial over F. Note that it is defined only for (n,p) = 1 where p = 
expchar(F). 


Theorem 10.2.6 Let S be the splitting field for x*—1 over F. Then 
G,(S) is isomorphic to Z* if and only if the n-th cyclotomic polynomial 
Q,,(x) is irreducible over F. 0 


Here are some basic facts about cyclotomic polynomials. 


Theorem 10.2.7 Let Q,(x) be the n-th cyclotomic polynomial over F. 
1) deg Q,(x) = ¢(n). 
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2) Q,(x) is monic and has coefficients in the prime subfield of F. 
3) If F = Q then the coefficients of Q,(x) are integers. 
4) The following product formula holds 


(10.2.2) x"—1= J] Q(x) 
d[n 

Proof. Part 1) follows directly from the definition of Q,(x). Part 4) 
follows from the fact that U,, is the disjoint union of Q4 over all d[n 
and Q,(x) has no multiple roots. Hence, the factorizations of both sides 
of (10.2.2) into a product of linear factors are identical. 

Let F’ be the prime field of F. It is clear from the definition that 
Q,,(x) is monic. We prove parts 2) and 3) together by induction on n. 
Since Q,(x) =x-—1, the result is true for n = 1. If p is a prime then 


P_ = = 
Qp(e) = ab awh EPP ptt 


and the result holds for n=p. Assume that 2) and 3) hold for all 
proper divisors of n. Then 


x el= QC) HT Qa(x) = Q,(x)R(x) 
d<n 


By the induction hypothesis, R(x) has coefficients in F’, whence so does 
Q,,(x) = (x - 1)/R(x). Moreover, if F=Q then R(x) has integer 
coefficients and since R(x) is monic, Theorem 1.2.1 implies that Q,(x) 
has integer coefficients. 


Example 10.2.1 Formula (10.2.2) can be used to compute cyclotomic 
polynomials rather readily, starting from the fact that 


Q(x) =x-1 
and 


Qy(x) = xPE + xP $text] 


for p prime. Thus, for example, 


ace. ie 
WW = Rw ENR ET 
xo 


= x®_] = 2h 
Qe(x) = Q1(x)Q2(x)Q3(x) ~ (x — 1)(x + 1)(x? +x +1) Pe Seer 
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and 
x18 _ 


= i = 
HO aaamag)s tee 


This gives us, for instance, the following a factorization of x!5—1 into 
cyclotomic polynomials over GF(2) 


x25] = (x41)(x?-+x41)(xt+x34x?4x4-1)(x8 4x" 4x54x44x54x41) 0 


Part 4) of Theorem 10.2.7 describes a factorization of x" —1 within 
the prime subfield of F (cf. Example 9.1.1). In general, however, this is 
not a prime factorization since Q,(x) is not irreducible. For instance, 
comparing Examples 10.2.1 and 9.1.1 shows that Q,,(x) is reducible 
over GF(2). 

With regard to the irreducibility of cyclotomic polynomials, we do 
have the following important results. 


Theorem 10.2.8 All cyclotomic polynomials Q(x) over Q are 
irreducible over Q. Therefore, GQ(S) ~ Z* and [S:Q] = ¢(n). 


Proof. Suppose that Q,(x) = f(x)g(x), where we may assume (by 
Theorem 1.2.2) that f(x) and g(x) are monic and have integer 
coefficients. Assume that f(x) is irreducible and that w is a root of f(x) 
and hence a primitive n-th root of unity. We claim that wP is also a 
root of f(x), for any prime p/n. For if not then w?, being a primitive n- 
th root of unity, must be a root of g(x). Hence, w is a root of g(x?), 
which implies that f(x) | g(xP) and we can write 


(x?) = h(x)f(2) 


where h(x) is monic and has integer coefficients. Since a? =a mod p, 
for any integer a, we conclude that g(x?) = g(x)P mod p and so, taking 
residues gives 


g(x)? = h(x)f(x) mod p 
If we denote the residue of a polynomial p(x) modulo p by p(x), we get 


B(x)? = h(x)f(x) 


in Z,[x] and so any irreducible factor of f(x) in Z pix] is also a factor of 
(x). This shows that f(x) and g(x) are not relatively prime, and 
therefore have a common root in some extension of Zo However, 
f(x)g(x) = x"-1, which has no multiple roots in any extension. This 
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contradiction implies that wP is a root of f(x). In other words, if w is a 
root of f(x) then so is ow, where o,, is the Frobenius map. 

Now we observe that any primitive n-th root of unity v over Q has 
the form w", for some integer r > 0. Writing r as a product of prime 
numbers, we see that » can be obtained from w by applying a finite 
number of Frobenius maps ¢,, where p is prime. Hence, v is also a root 
of f(x). Thus all roots of Q,(x) are roots of f(x), implying that f(x) = 
Q,,(x), whence Q,(x) is irreducible over Q. § 


Theorem 10.2.9 Let n be an odd positive integer. Then [F(w,):F] = 4(n) 
implies [F(wg):F] = ¢(d) for all d|n. In the language of cyclotomic 
polynomials, if Q(x) is irreducible over F then Q,(x) is irreducible over 
F for all d|n. 


Proof. Let p be a prime dividing n. Since n is odd, p # 2. Let n= pm. 
Then 


(pm) = [F(w,):F(wp)][F(h):F] = ab 


where a = [F(w,):F(wP)] < p and b = [F(w®):F] | ¢(m), since wWP € 2,,. 
If p{m then ¢(pm) = ¢(p)¢(m) = (p — 1)¢(m) and so 


(p — 1)¢(m) = ab 


If a= p, then b = (p—1)¢(m)/p cannot divide ¢(m) since p £ 2. Since 
b < ¢(m), it follows that a = p—1 and b = ¢(m). On the other hand, if 
p|m then ¢(pm) = p¢d(m) and so 


p¢(m) = ab 


whence a= p and b= ¢(m). In either case, b= ¢(m), and since wP = 
Wy Jp We have 


(10.2.3) [F (wp /p):F] = 6) 
Repeated use of (10.2.3) gives the desired result. § 


Let us return briefly to finite fields. If p(x) is monic and irreducible 
over GF(q) and has order v, then each root of p(x) has order v and thus 
p(x) | Q,(x). Since every monic irreducible factor of Q,(x) has order v, 
we conclude that Q,(x) is the product of all monic irreducible 
polynomials of order v. According to Theorem 8.6.3, the degree of any 
such factor p(x) is 0,(q), the order of q modulo v. Hence, the number of 
monic irreducible polynomials of order v_ is ¢(v)/0,(q). 
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Theorem 10.2.10 Let v be a positive integer. 


1) The cyclotomic polynomial Q,,(x) over GF(q) is the product of all 
monic irreducible polynomials of order v over GF(q). 

2) The number of monic irreducible polynomials over GF(q) of order 
v is $(v)/0,(q), where 0,(q) is the order of q mod v. 0 


Equation (10.2.2) is a prime candidate for Mébius inversion. (See the 
appendix for a discussion of Mébius inversion.) Applying the 
multiplicative version gives 


Qax) = [] t= 10/9) = TT] or’ — 1) 


d|n d[n 


where the Mobius function p is defined by 


1 if d=1 
p(d) = { (-1)k if d=p,po:--p, for distinct primes p, 
0 otherwise 


Note that some of the exponents p(d) may be equal to -1, and so a 
little additional algebraic manipulation may be required to obtain 
Q,,(x) as a product of polynomials. 

Finally, let us mention that, according to the definition, if v = 
q"—1, then the roots of the v-th cyclotomic polynomial Q,(x) over 
GF(q) are the primitive v-th roots of unity over GF(q). Hence, they are 
the group primitive elements of GF(q"). In other words, the monic 
irreducible factors of Q(x) are precisely the primitive polynomials of 
GF(q") over GF(q). Thus,.one way to find primitive polynomials is to 
factor this cyclotomic polynomial. 


*10.3 Normal Bases and Roots of Unity 


Recall that a normal basis for F < E is a basis for E over F that 
consists of the roots of an irreducible polynomial p(x) over F. (See 
Section 7.4.) We have seen that, in some important cases (especially 
F = Q), the cyclotomic polynomials Q_ (x) are irreducible over F, which 
leaves open the possibility that the primitive n-th roots of unity Q, 
might form a normal basis for S over F. Indeed, if Q,(x) is irreducible 
then Q,(x) = min(w,,F) and so 


deg Qa(x) = [S:F] 
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and since the roots of Q,(x) are distinct, there are the right number of 
primitive n-th roots of unity and they will form a basis for S over F if 
and only if they span S over F. It happens that Q, spans S if and only 
if n has a certain simple form. 


Theorem 10.3.1 Let F be a field with the property that Q.(x) is 
irreducible over F for all m. Then Q, is a normal basis for the 
cyclotomic extension S over F if and only if n is the product of distinct 
primes. 


Proof. We prove first that if n is a product of distinct primes then 2, is 
a (normal) basis for S over F. Let w €,. If n =p is prime then Q, = 
{w,w,...,wP1}, Since S=F(w) and min(w,F)=Q,(x) has degree 
p—1, the set {1,w,...,W?7?} is a (polynomial) basis for S over F. Since 


ope tw lS 
Ll+tw+ee+uw = O21 72 


the set 2, = {w,...,wP71} is also a basis for S over F. Hence, the result 
is true if n is prime. 

For the purposes of induction, suppose the result is true for all proper 
divisors of n and let n = km with k <n, m <n and (k,m) = 1. If wy, € 
Q, and w,, € 2, then 2, is a basis for F(w,) over F and ©, is a basis 
for F(w,,) over F. Since wu, € Qn, and (k,m) = 1, it follows that 


(Wily) = WK EO, and (Wy) ™ = WE EO, 
whence F(wp,l,) = F(Wyun) = F (Wn) WhETE Wm € Ans Hence, 
[Fey n):F (Mn) LF mn) :F] = (Fm) ?F] = [F hen) FI 
= ¢(km) = $(k)d(m) = $(k)[F(H,):F] 


and so [F(w,,W.,):F(w,,)) = ¢(k). Since Q, spans F(w,,w,,) over F(wy) 
and |Q,| =¢4(k), it follows that Q, is a basis for F(w,,w,,) over 
F(w.,)) Whence Q\., = 0, is a basis for F(w,,w,.,) over F. This 
proves that if n is the product of distinct primes, then Q, is a basis for 
F(w,). 


For the converse, let n = mp* for k > 2. Since 
k-1 
Q(x) ie Quapt(*) = Qyap(x? ) 


(an exercise) the coefficient of xP)! in Q,,(x) is 0, whence the sum of 
the roots of Q(x), that is, the sum of the primitive n-th roots of unity, 
is 0, showing that these roots are linearly dependent. Hence, they 
cannot form a basis for S over F. 
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*10.4 Wedderburn’s Theorem 


In this section, we present an important result whose proof uses the 
properties of cyclotomic polynomials. 


Theorem 10.4.1 (Wedderburn’s Theorem) If D is a finite division ring 
then D is a field. 


Proof. We begin by recalling Example 0.2.1, which describes an instance 
of the class equation. Let the group D* act on itself by conjugation. The 
stabilizer of @ € D* is the centralizer 


C*(B) = {a € D* | af = Ba} 
and the class equation is 


|D*| = 1200) + Dre 


where the sum is taken over one representative @ from each conjugacy 
class 0(8) = {aBa~!|a €G} of size greater than 1. If we assume for 
the purposes of contradiction that Z(D*) # D*, then the sum on the far 
right is not an empty sum and |C*(@)| < | D*| for some 6 € D*. 

The sets 


Z(D) = {8 €D| Ba =a for all a € D} 
and 


C(B) = {a ED | af = Ba} 


are subrings of D and, in fact, Z(D) is a commutative division ring; 
that is, a field. Moreover, Z(D)* = Z(D*) and C(£)* = C*(@) for B #0. 
Let |Z(D)| =z. Since Z(D) C C(B), we may view C(#) and D as 


vector spaces over Z(D) and so 
|C(8)| =2> and |D| =2" 
for integers 1 < b <n. The class equation now gives 


n 
—1 
es at Ds a 
be? = 


and since z> —1|2z™—1, it follows that b | n. 

If Q(x) is the n-th cyclotomic polynomial over Q, then Q,(z) 
divides 2" —1. But Q,(z) also divides each summand on the far right 
above, since for b|n, b <n we have 
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n _— 

5 = TT / T1909 
k|n j|b 

and Q,,(x) divides the right hand side. It follows that Q,(z)|z—1. On 

the other hand, 


Q,(2) = [] @-») 
EQ 


wW 


and since w € 2, implies that |z—w| > |z| — |w| =z—-1, we havea 
contradiction. Hence Z(D*) = D* and D is commutative, that is, D is a 


field. 


*10.5 Realizing Groups as Galois Groups 


A group G is said to be realizable over a field F if there is an 
extension F < E for which Gp(E) ~ G. Since any finite group of order n 
is isomorphic to a subgroup of a symmetric group S,, we have the 
following. 


Theorem 10.5.1 Let F be a field. Every finite group is realizable over 
some extension of F. 


Proof. Let G be a group of order n. Let t,,...,t, be algebraically 
independent over F and let s,,...,8, be the elementary symmetric 
functions in the t,’s. Then K = F(t,,...,t,) > F(s;,...,8,) =E is a 
Galois extension whose Galois group is isomorphic to S,. (See Theorem 
6.2.1.) We may assume that G is a subgroup of G,(K) and since G is 
closed in the Galois correspondence, it is the Galois group of F(G) < K.& 


It is a major unsolved problem to determine which finite groups are 
realizable over the rational numbers Q. We shall prove that any finite 
abelian group is realizable over Q. It is also true that for any n, the 
symmetric group S,, is realizable over Q, but we shall prove this only 
when n = p is a prime. 


Realizing Finite Abelian Groups over Q 


We shall have use for a special case of a famous theorem of Dirichlet, 
which says that if n and m are relatively prime positive integers then 
there are infinitely many prime numbers of the form nk +m. We need 
the case m = 1. First a lemma on cyclotomic polynomials. 


Lemma 10.5.2 Let p be a prime and let (n,p) = 1. Let Q,(x) be the n-th 
cyclotomic polynomial over Q and let P,(x) be the n-th cyclotomic 
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polynomial over Z,. If Q, (x) is the polynomial obtained from Q,(x) by 
taking the residue of each coefficient modulo p, then Q,(x) = P,,(x). 


Proof. If n =r is a prime then Q,(x) , P,(x) and Q,(x) are all equal to 
xr ee eo 1 


and so the result holds for n prime. Suppose the result holds for all 
proper divisors of n. Since 


x"—1= [J Qa(x) 


d|n 
taking residues modulo p gives 
x7-1= T[Qa(x) 
d|n 
over Z,. But 
x"-l= [[?a® 
d|n 


over Z, and since P(x) = Qa(x) for all d|n, d<n, we have P,(x) = 
Q,,(x). 


Theorem 10.5.3 Let n be a positive integer. Then there are infinitely 
many prime numbers of the form nk + 1, for k € zt. 


Proof. Suppose to the contrary that p,,...,p, is a complete list of all 
primes of the form nk+1. Let m=p,---p,n. Let Q(x) be the m-th 
cyclotomic polynomial over Q and consider the polynomial Q,,(mx). 
Since Q,(x) has integer coefficients, Q,,(mk) is an integer for all k € 
Z*. Since Q,,(mk) can equal 0, 1 or -1 for only a finite number of 
positive integers k, there exists a positive integer k for which 
|Q,,(mk)| >1. Let p be a prime dividing Q,,(mk). Since 
Q(x) | x™— 1, we have 


p|(mk)"—-1 
which implies that p/m, hence p # p; for i = 1,...,s. 


If P,,(x) is the m-th cyclotomic polynomial over Z, then it follows 
from the fact that p|Q,,(mk), and the previous lemma, that 


P,,(mk) = Q,,(mk) = Q,,(mk) = 0 
in Z,, where mk is the residue of mk modulo p. Thus, mk is a primitive 


m-th root of unity over Z,. In other words, mk has order m in vi and 
since the order of any element must divide the order of the group, we 
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get m|p—1. It follows that n|p—1, that is, p=nk+1, which is a 
contradiction, proving the theorem. &l 


Theorem 10.5.4 Let G be a finite abelian group. Then there exists an 
integer n and a field E such that Q < E < Q(w), where w is a primitive 
n-th root of unity, and such that Ga(E) ~G. 


Proof. By Theorem 10.2.8, the Galois group of Q(w) is isomorphic to 
Z*. Since Z* is abelian, any subgroup K of Z% is normal in Z* and so 
Q < F(K) is a Galois extension, with Galois group 


Gg(K) = Z4/K 


Thus, we need only show that any finite abelian group G is isomorphic 
to a quotient group Z*/K, for some integer n. 
Since G is finite and abelian, we have 


G ~ C(n,) x-++ x C(n,) 


where C(n;) is cyclic of degree n;. According to Theorem 10.5.3, we may 
choose distinct primes pj,...,p, of the form n,---n.k+1 and so 
n;|p;—1 for i=1,...,s. Since the cyclic group 

ra 
has order p,;—1, it has a subgroup of any order dividing p;—1, in 
particular, a subgroup K; of order (p; — 1)/n;, whence the quotient 


Z/ K; 
is cyclic of order n;, and is therefore isomorphic to C(n;). Hence, if K = 
K, x:::x K, and n = p,::-p, then 
* * * * * 
es Peg tes ee 
Oe KK aS 8 OSS CUM ~~ 
~ Ky K K, x-+:x K, K ~K 


as desired. 8 


Realizing Sp over Q 


We begin by discussing a sometimes useful tool for showing that the 
Galois group of a polynomial is a symmetric group. 

Let G be the Galois group of an irreducible polynomial f(x) € F[x], 
thought of as a group of permutations on the set R of roots of f(x). 
Then G acts transitively on R. Let us define an equivalence relation on 
R by saying that r~s if and only if either r=s or the transposition 
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(r,s) is an element of G (more properly, an element of G acts as the 
transposition (r,s) on R.) It is easy to see that this is an equivalence 
relation on R.. 

Let [r] be the equivalence class enataiaing r, and assume there exists 
s, r€R such that s#r and s~r. In other words, assume that G 
contains a transposition (r,s). If ¢ € G then o(r,s)o~! sends or to os, os 
to or and fixes all other elements of R, whence o(r,s)o~! = (or,os). 
Thus, s~r implies os ~ or and so o[r] = [or]. This shows that [r] and 
[or] have the same cardinality and since G acts transitively on R, all 
equivalence classes have the same cardinality. 

It follows that if |RJ| is prime, then there can be only one 
equivalence class, which implies that (r,s) € G for all r, s€ R. Since G 
contains every transposition, it must be the symmetric group on R. We 
have proved the following. 


Theorem 10.5.5 If f(x) € F[x] is a separable polynomial of prime degree 
p and if the Galois group G of f(x) contains a transposition, then G is 
isomorphic to the symmetric group Sp: 0 


Corollary 10.5.6 If f(x) € Q[x] is irreducible of prime degree p and if 
f(x) has precisely two nonreal roots, then the Galois group of f(x) is 
isomorphic to the symmetric group Sp: 


Proof. Complex conjugation on C is an automorphism of C leaving Q 
fixed. Since the splitting field S for f(x) over Q is Galois, conjugation is 
a Q-automorphism of S, and therefore belongs to G,(S). Since it leaves 
the p—2 real roots of f(x) fixed, it is a transposition on the roots of 
f(x). Thus, the theorem applies. § 


Example 10.5.1 Consider the polynomial f(x) =x°—5x+2, which is 
irreducible over Q by Eisenstein’s criterion. A quick sketch of the graph 
reveals that f(x) has precisely 3 real roots and so its Galois group is 
isomorphic to S;. 0 


Corollary 10.5.6 is just what we need to establish that Sp 
realizable over Q. 


Theorem 10.5.7 Let p be a prime. There exists an_ irreducible 
polynomial p(x) over Q of degree p such that p(x) has precisely two 
nonreal roots. Hence, the symmetric group 5, is realizable over Q. 


Proof. The result is easy for p = 2 and 3, so let us assume that p> 5. 
Let n be a positive integer and m>5 be an odd integer. Let 
k,,...,k,,_9 be even integers and let 
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a(x) = (x? + n)(x ~ ky)-+-(x kya) 


It is easy to see from the graph that q(x) has (m—3)/2 relative 
maxima. Moreover, if k is an odd integer, then 


Iq(k)| >2|k? +n] >2 


Let p(x) = q(x) —2. Since the relative maxima of q(x) are all greater 
than 2 and since q(—co) = —co and q(co) = 00, we deduce that p(x) has 
the same number m — 2 of real roots as q(x). 

We wish to choose a value of n for which p(x) has at least one 
nonreal root z, for then the complex conjugate Z is also a root, implying 
that p(x) has two nonreal roots and m—2 real roots. Let the roots of 
p(x) in a splitting field be a,,...,a,,. Then 


p(x) = Il (x— Q;) = (x? +n)(x— k,): +(x- k 2) ~2 


i=l 


m=1 and x™-?2 


Equating coefficients of x gives 


i=1 i=1 1<j 1<Jj 
and so 
m m m—2 
doa? =( DeaiP-2 aig =( Dow) A Do ik +n) 
i=l i=1 i<j i= i<j 
m—2 
= k? —2n 
i=l 


If n is sufficiently large, then do? is negative, whence at least one of 
the roots a; must be nonreal, as desired. 

It is left to show that p(x) is irreducible, which we do using 
Eisenstein’s criterion. Let us write 


q(x) = (x? +.n)(x —k)+\(x —k,, 9) = + ay pe te tag 


In the product (x —k,)---(x—k,,_»), each coefficient except the leading 
one is divisible by 2. Hence, we may write 


(x — ky)s++(X — Kg) = x? + 2f(x) 


Multiplying by x? +n gives 
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q(x) = x™ + 2x7£(x) + nx™? + Onf(x) 


Taking n to be even, we deduce that all nonleading coefficients of q(x) 
are even. In addition, the constant term of q(x) is divisible by 4 since 
m>5. It follows that p(x)=q(x)—2 is monic, all nonleading 
coefficients are divisible by 2, but the constant term is not divisible by 
2? = 4. Therefore p(x) is irreducible and the proof is complete. § 


Exercises 
All cyclotomic polynomials are assumed to be over fields for which they 
are defined. 


1. Prove that if x*—1=Q,(x)p(x) where p(x) € Z[x] then Q,(x) € 
Z[x]. 

2. | When is a group primitive element of the cyclotomic extension S,, 
also a primitive n-th root of unity over GF(q)? 

3. If (n,q) 4.1, how many n-th roots of unity are there over GF(q)? 

4. What is the splitting field for x*—1 over GF(3)? Find the 
primitive 4-th roots of unity in this splitting field. Do the same for 
the 8-th roots of unity over GF(3). 

5. If a,,...,@, are the n-th roots of unity over GF(q) show that 
ak +ak+..-+ak=0 forl<k<n. 

6. Show that Q,(x) € GF(q)[x] is irreducible if and only if 
0,(q) = ¢(n). 

7. If (n,q)=1, prove that x®~!}4x"-?4...+x+41 is irreducible 
over GF(q) if and only if n is prime and Q,(x) is irreducible. 


8. Show that ifr is a prime, then Q(x) = (x"” -1) /(x?-1 —1). 
9. Evaluate Q,(1). 

10. Evaluate Q,(-1). 

11. Show that Q(w,) N Q(w,,) = Q if (m,n) = 1. 


Verify the following properties of the cyclotomic polynomials. As usual, 
p is a prime number. 


12, Qup(x) = Qy(x”)/Q,(x) for pfn. 
13. Qup(*) = Q,,(x?) for all p|n. 

14, Qupt(*) = Qap() 

15. Q,(0) =1 for n> 2. 

16. Q,(xt)x*) = Q,(x) for n > 2. 


17. Ifn= pyle -pyk is the decomposition of n into a product of powers 
of distinct primes, then 
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ey- e,-1 


Q(x) = QD x on "PK ) 


On the structure of ZF. 


18. 


19. 


20. 


21. 


22. 


If n = []r; where r; = P;i are distinct prime powers then 
aK 
Zt ~ IZ. 


Assume p # 2 is prime. Let n = p*. 

i) Show that |Z*| =p*}(p—1). 

ii) Show that Z* has an element of order p— 1. 

iii) Show that 1 re p € Z* has order pol. 

iv) Show that Z* is cyclic. 

v) Ifn=2° then Z* is cyclic if and only if e = 1 or 2. 
vi) Show that ZF is cyclic if and only if n = p®, 2p® or 4. 


If n>1 then there exists an irreducible polynomial of degree n 

over Q whose Galois group is isomorphic to Z,. 

ey an integer n and a field E such that Q<E< Q(w,) with 
Go(E) = Zg. Here w,, is a primitive n-th root of unity over Q. 

Calculate the Galois group of the polynomial f(x) = x° — 4x + 2. 


More on Constructions 


The following exercises show that not all regular n-gons can be 


constructed in the plane using only a straight edge and compass. The 
reader may refer to the exercises of Chapter 2 for the relevant 
definitions. 


Definition A complex number z is constructible if its real and imaginary 
parts are both constructible. 0 


Ci. 


C2. 


C3. 


C4. 


C5. 


C6. 


Prove that the set of all constructible complex numbers forms a 
subfield of the complex numbers C. _ 

Prove that a complex number z = re? is constructible if and only 
if the real number r and the angle 0 (that is, the real number 
cos @) are constructible. 

Prove that if z is constructible, then both square roots of z are 
constructible. Hint: use the previous exercise. 

Prove that a complex number z is constructible if and only if there 
exists a tower of fields Q<F,<-+:<F,, each one a quadratic 
extension of the previous one, such that z € F,. 

Prove that if z is constructible, then [Q(z):Q] must be a power of 
2. 

Show that the constructibility of a regular n-gon is equivalent to 
the constructibility of a primitive n-th root of unity w,. Since the 
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cyclotomic polynomial Q,(x) is irreducible over the rationals, we 
have [Q(4,):Q] = deg Q,(x) _ o(n). 
Prove that ¢(n) is a power of 2 if and only if n has the form 


n= 2p: “Pm 
where p,, are distinct Fermat primes, that is, primes of the form 
Ss 
QF +1 


for some nonnegative integer s. Hint: if +1 is prime then j must 
be a power of 2. Conclude that if n does not have this form, then 
a regular n-gon is not constructible. For instance, we cannot 
construct a regular n-gon for n = 7, 11 or 90. [Gauss proved that if 
n has the above form, then a regular n-gon can be constructed. See 
Hadlock (1978).] 


Chapter 11 
Cyclic Extensions 


Continuing our discussion of binomials begun in the previous 
chapter, we will show that if a is a root of x" —u and if w is a primitive 
n-th root of unity over F, then F(w,q) is a splitting field for x" —u over 
F. Moreover, in the tower 


F < F(w) < F(w,a) 


the first step is a cyclotomic extension, which as we have seen, is 
abelian and may be cyclic. The second step is cyclic of degree d|n. 
Nevertheless, as we will see in Chapter 13, the Galois group Gp(F(w,a)) 
need not even be abelian. In studying the second step in this tower, we 
will actually characterize finite cyclic extensions, when the base field 
contains appropriate roots of unity. 

Before beginning, we remark that if F is a field of characteristic 
p #0 and if p|n, then F cannot contain a primitive n-th root of unity. 
For if n = pm and w"—1=0 then 


0=u®-1=0P™—1=(u™—1)P 
and so w™ = 1, whence w is an m-th root of unity, for m <n. Thus, 
saying that a field F contains a primitive n-th root of unity tacitly 


implies that (n,expchar(F)) = 1. (Such an implication is not made by 
saying that F contains the n-th roots of unity.) 
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11.1 Cyclic Extensions 


Let F be a field with expchar(F) =p, let u€F and let S be the 
splitting field for the binomial x"—u over F. We will assume 
throughout that (n,p) = 1 and so x" —u has n distinct roots in S. 

If a is a root of x"—u in S and w is a primitive n-th root of unity 
over F then the roots of x" —u are 


(11.1.1) a, wa,...,Ww9 le 


and so S = F(w,a). In words, all n-th roots of u can be obtained by first 
adjoining the n-th roots of unity and then adjoining any single n-th root 
of u. 

The extension F <S can thus be decomposed into a tower 


F < F(w) < F(w,a) =S 


The first step is cyclotomic. We turn to a study of the second step. 

It will simplify the notation to assume that w € F. Thus S = F(a) is 
a splitting field for x*—u and so F < F(a) is a Galois extension. Each 
o € G = G,(S) is uniquely determined by its value on a and 


oa = we 


for some k(c) € Z,. Since w € F, we have for 0, 7 €G 


(ora = o(wk Ma) = uk Mea = wk) Kg 

Hence, the map oul?) is a group monomorphism from G into U, 
and therefore G is isomorphic to a subgroup of U,. It follows that G is 
cyclic and if |G| =[F(a):F] =d then d|n. As the next theorem shows, 
this actually characterizes cyclic extensions when the base field contains 


a primitive n-th root of unity. 


Theorem 11.1.1 Let F be a field containing .a primitive n-th root of 
unity. The following are equivalent. 


1) F< Eis cyclic of degree d|n. 

2) E=F(qa) where min(a,F) = x4—v, for v € F and d|n. 

3) Eis a splitting field for an irreducible binomial x4—v, where v € 
F and d|n. 

4) E=F(qa) where a is a root of a binomial x" —u, for u€ F. 

5) Eis a splitting field for a binomial x" —u, for u € F. 


Proof. Let us first show that 2) through 5) are equivalent. Since F 
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contains a primitive d-th root of unity for any d|n, it is clear that 2) 
and 3) are equivalent, as are 4) and 5). If 2) holds then since a4 = v € 
F and x4— a4 divides x"—a", it follows that @ is a root of x"—u 
where u=a™€F. Hence 4) holds. Suppose now that 4) holds. The 
roots of x"—u are given by (11.1.1) and since the d roots of p(x) = 
min(a,F) are pmong me list (11.1.1), their predich, which lies in F, has 
the form wa’. Hence a4 € F and p(x) = x4— a4. Thus, 2) holds. 

We have already shown that 4) implies 1) so it remains to prove that 
1) implies 2). Suppose that F < E is cyclic of degree d|n, with Galois 
group 


G=(c)= {1,0,...,097}} 


Note that F contains a primitive d-th root of unity €= n/ a Now; 
ate F= F(G) if and only if oat = a4, which is equivalent to (ca)4 = 


Qa’, or 


(11.1.2) (4x)" =1 


If we can find an a € E for which pelea = , then (11.1.2) will hold, we 
will neve ave F, whence x?— a4 € F[x] and if p(x) = min(a,F) then 
p(x) | x4— 4. But the roots of p(x) are 


(11.1.3) a, oa,...,09 1a 


and since oa = é-la, we have oXa = Eka, which implies that the 
elements (11.1.3) are distinct. Hence, deg p(x)=d and so p(x) = 
x4— a4 and E = F(a), as desired. 

Thus, we are left with finding an a € E for which a/oa = €. Since 
£€F, its norm satisfies 


Ngr(é) = elBF] — eda 
The proof is then completed by taking @ = € in the following theorem. § 


Theorem 11.1.2 (Hilbert’s Theorem 90) Let F <E be a finite cyclic 
extension with Galois group G = (c). An element § € E has the form 


al 


B= 


for some a € E* if and only if its norm Np je(8) is equal to 1. 


Proof. Let [E:F] = d. Suppose that Np jr(8) = = 1. We desire an a € E for 
which B(oa) = a. Consider the maps 
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To =4 T= B(0B)(o7A)---(ok1B)ok, for0<k<d 


Then 
Tk41 = B(or,), for0<k <d-1 


Since Tg = Ngjr(S)o4 =.= Tg, the map 


which is nonzero by the Dedekind Independence Theorem, satisfies 
B(or) =7. Since r #0, there exists a nonzero y € E for which ry #0 
and so (Bo)(ry)=Ty, that is, 8 =ry/o(ry), whence a=ry is the 
desired element. We leave proof of the converse to the reader. § 


11.2 Extensions of Degree Char(F) 


There is an “additive” version of Theorem 11.1.1 which deals with 
cyclic extensions of degree equal to p=char(F) >0, where the role of 
the binomial x" — u is played by the polynomial x? — x — u. 

Suppose that F is a field of characteristic p#0. Let F<E and 
suppose that a € E is a root of the polynomial 


f(x) =xP-—x-u 


for u € F. Since the prime subfield of F is Z,, and since kP =k for any 
ke Zo the p distinct elements 


a,at+1,...,.a+p-—l 


are the roots of f(x). Unlike the previous case, we need no special 
conditions on F to insure that if an extension of F contains one root of 
f(x), it contains all the roots of f(x). Hence, F(a) is a splitting field of 
f(x). 

We have two cases to consider. If a € F then f(x) splits in F. Now 
suppose that a ¢ F. Then p(x) = min(a,F) has degree d > 1, with roots 


a, A+e),...,4+0€g 4 


where 0 < e; < p—1. The sum of these roots is da +k, for some integer 
k, and since this number lies in F but ag F, we must have d=p, 
whence f(x) = min(a,F) is irreducible. In short, f(x) either splits in F or 
is irreducible over F with splitting field F(a), for any root a of f(x). 
Since F(a) is a splitting field for the separable polynomial f(x) = 
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xP —x—u, we deduce that F < F(a) is Galois. If f(x) is irreducible over 
F and G = G;(F(q)), there exists a o € G for which oa = a +1. Since 
ofa =a +i, it follows that G = (c) = {0,0,...,0P 71} is the cyclic group 
generated by o. 


Theorem 11.2.1 (Artin—-Schreier) Let char(F) = p #0. The polynomial 
f(x) = xP —x—u either splits in F or is irreducible over F. Moreover, 
the following are equivalent. 


1) F< E is cyclic of degree p. 

2) E=F(qa) where min(a,F) = x? —x—u, forueé F. 

3) E is a splitting field for the irreducible polynomial x? —x—u, 
where u € F. 


Proof. It is clear that 2) and 3) are equivalent and we have seen that 2) 
implies 1). To prove that 1) implies 2), suppose that F < E is cyclic of 
degree p, with Galois group G = (c) = {t,0,...,0? 1}. Then a? -a EF 
if and only if 


a(aP —a) =aP-a 
or, equivalently, 
(ca—a)P =ca-a 
Hence, if we find an a€E for which ga—a=1 then aP—a€F. 


Moreover, ca =a +1 and so o'a =a +i, which implies that the roots 
of min(a,F) are the distinct values 


a,a+1,...,a+p-—l 
It follows that 
min(a,F) = xP —x—(aP —a) 
and hence that [F(a):F] =[E:F] and E = F(a). Since Trgjp(-!) =0, 


the proof is completed by taking 6 =-—-1 in the additive version of 
Hilbert’s Theorem 90 given below. & 


Theorem 11.2.2 (Hilbert’s Theorem 90, Additive Version) Let F < E be 


a finite cyclic extension with Galois group G = (c). An element @ € E 
has the form 8 = a—oa for some a € E if and only if Trg/p(P) =0. 


Proof. Let [E:F] = n and consider the map 
7 = Bo +[B+(of)]o? +--+ [8+(o8)+--+(0"-*A)]o" 


It is easy to verify that t—or =GB(tt+o4---+o"-!) and so if 
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Trp j(7) = 1 for some 7 € E (such a y must exists since F < E is finite 
and separable and so the trace map is not the zero map) then 


TY-oTY= BT rg p(y) =f 
Thus, a = Ty is the desired element. 


In this section and the previous one, we have discussed cyclic 
extensions of degree n where (n,expchar(F)) = 1 or n = p = char(F) £ 0. 
A discussion of cyclic extensions of degree n = p* for k > 1 is quite a bit 
more involved (requiring a discussion of so-called Witt vectors) and thus 
falls beyond the intended scope of this book. The interested reader may 
wish to consult the books by Karpilovsky (1989) or Lang (1993). 


Exercises 

1. Let F <E be cyclic of degree n, with Galois group G = (c). If 8 € 
E has the form S=a/oa for some 0#a@€E, show that 
Ngp(A) = 1. 

2. Let F<E be cyclic of degree p" where P is a prime. Let 
F<K<E with F<K cyclic of degree p* where d<n. Let 
F <L <E and suppose that E = KL. Show that E = L. 

3. Let char(F) =p #0 and let F(a,) = F(a) be cyclic of degree p 
over F, where min(a;,F) =xP—x—u,. Show that a) =na,+b 
where b€ F and 0 <n<p-l. 

4. Let F be a field and let E be the extension of F generated by the 
n-th roots of unity, for all n > 1. Show that F < E is abelian. 

5. Let E be a field and let o be an automorphism of E of order d. 
Suppose that @€E has the property that o@ = and p4=1. 
Prove that there exists an a € E such that ca = af. 

6. Let E be a field and let o be an automorphism of E of order d > 1. 
Show that there exists an a € E such that ca =a +1. 

7. Let F<E be finite and abelian. Show that E=F,:--F,, is the 
composite of fields F; such that F <F; is cyclic of prime power 
degree. Thus, the study of finite abelian extensions reduces to the 
study of cyclic extensions of prime power degree. 

8. Let F be a field containing the n-th roots of unity. Let F be an 
algebraic closure of F. Show that if a € F is separable over F and 
if a is a root of the binomial x"—u with u € F, then F < F(a) is 
cyclic of degree d|n. 


Chapter 12 
Solvable Extensions 


We now turn to the question of when an arbitrary polynomial 
equation p(x) = 0 is solvable by radicals. Loosely speaking, this means 
(for char(F) = 0) that we can reach the roots of p(x) by a finite process 
of adjoining n-th roots of existing elements, that is, by a finite process 
of passing from a field K to a field K(a), where a@ is a root of a 
binomial x"—u, with u@K. We begin with some basic facts about 
solvable groups. 


12.1 Solvable Groups 


Definition A normal series in a group G is a tower of subgroups 
{e} =Gg < Gy <Gy<-++<G,=G 


where G;<«G;,,. A normal series is abelian if each factor group G;,,/G; 
is abelian, and cyclic if each factor group is cyclic. 0 


Definition A group is solvable (or soluble) if it has an abelian normal 
series. 0) 


Theorem 12.1.1 The following are equivalent for a nontrivial finite 
group G. 


1)  G has an abelian normal series. 
2)  G has acyclic normal series. 
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3) G has a normal series in which each factor group G,, ,/G; is cyclic 
of prime order. 


Proof. It is clear that 3)=>2)=1). Thus, we need only prove that 1)=>3). 
Let {G;} be an abelian normal series. We wish to refine this series by 
inserting subgroups until all quotients have prime order. The 
Correspondence Theorem (Theorem 0.2.15) says that the natural 
projection 7:G;,,Gi44 /G, is a normality-preserving bijection from the 
subgroups of G;,, containing G; to the subgroups of G;,,/G;. Hence, 
by Cauchy’s Theorem, if a prime p divides o(G;,,/G;) then G,,,/G; 
has a subgroup of order p, which must have the form H;/G, for 
G, < H; < G44. 

Since G;,,/G; is abelian, H;/G;4G,,,/G;, whence H,4G;,). Since 
G,4G;,;, we also have G,<H;. Thus, G;<H;<«G,,,. Note also that 
H,/G; is abelian and, by the Third Isomorphism Theorem, 


.../G: 
Gj44/H; = Sat 

1 1 
is the quotient of an abelian group and is therefore also abelian. 

Thus, we have refined the original abelian normal series by 
introducing H;, where H;/G; has prime order. Since G is a finite group, 
we may continue the refinement process until we have an abelian 
normal series, each of whose quotient groups has prime order. § 


The next theorem gives some basic properties of solvable groups. The 
proofs of all but statement 2) can be found in standard texts on group 
theory. 


Theorem 12.1.2 

1) Any finitely generated abelian group is solvable. 

2) (Feit-Thompson) Any finite group of odd order is solvable. 

3) Any subgroup of a solvable group is solvable. 

4) If H<G then G is solvable if and only if H and G/H are solvable. 

5) Any homomorphic image of a solvable group is solvable. 

6) The direct product of a finite number of solvable groups is 
solvable. 

7) The symmetric group S,, is solvable if and only ifn < 4. 0 


12.2 Solvable Extensions 


Although the upcoming results can be proved in the context of 
arbitrary finite extensions, we shall restrict our attention to separable 
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extensions. As the reader knows, this produces no loss of generality for 
fields of characteristic 0 or finite fields. 


Definition A finite separable extension F < E is solvable if there exists a 
field S for which F < E<S, where F <S is Galois and has a solvable 
Galois group G,(S). 0 


Theorem 12.2.1 


1) If F <E is solvable, then there exists a field S such that F< E<S 
where F < § is finite, Galois and solvable. 

2) A finite Galois extension F <E is solvable if and only if the 
Galois group Gp(E) is solvable. 

3) If F < E is solvable and E”° is the normal closure of E over F then 
F < E”* is solvable. 


Proof. Let F < E be solvable and let S be the field mentioned in the 
definition. Since F<S is normal, we have F<E<E™<S. By 
Theorem 4.5.2, the separability of F < E implies that F < E"° is Galois. 
Moreover, 


is solvable and so F < E”°. This proves part 3). Theorem 2.9.6 implies 
that if F <E is finite then so is F <E™° and so part 1) is proved. 
Finally, if F < E is finite, Galois and solvable then E™° = E and part 3) 
implies part 2). The converse is obvious. &f 


In view of part 1) of the previous theorem, we may always assume 
that the field S in the definition of solvable is a finite extension of F. 


Theorem 12.2.2 The class of solvable extensions is distinguished. 


Proof. Suppose first that F < E is solvable and F < K is arbitrary. Then 
there exists a field S such that F < E<S with F <§ finite, Galois and 
G,(S) solvable. Hence, K < SK is finite and Galois. Since G,(SK) is 
isomorphic to Gy ,s5(S), which is a subgroup of G,(S), it too is 
solvable. Hence K < EX is solvable. 

Suppose now that F <E is solvable and F<K<FE. Hence, there 
exists an S such that F< K<E<S where F <S is finite and Galois 
and G,(S) is solvable. It follows that K < S$ is Galois and since G;(S) is 
a subgroup of G,(S), the former is solvable, whence K < E is solvable. 
It is evident that F < K is solvable. 

Suppose now that F< K<E with F <K and K <E solvable and 
consider Figure 12.2.1. 
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Figure 12.2.1 


Since F < K is solvable there exists S, such that F< K<S, where 
F < Sy is finite, Galois and Gp(S,) is solvable. Similarly, since K < E 
is solvable, there exists a field Sp such that K < E < Sp where K < Sp is 
finite, Galois and Gy(S,) is solvable. Since K < E is solvable, the lifting 
Sx <S,E is solvable and so there exists a field T such that 
Sk <S,E < T where Sx < T is finite, Galois and 


Gs (T) 
is solvable. 
If F < T was normal, our problems would be quickly solvable, but it 
need not be. Thus, we turn to the normal closure T"° of T over F. Since 


F <T is finite and separable, it follows that F <T"° is finite and 
Galois. Recall that T"° = VoT, for all o € Hom,(T,T) and since 


| Hom,(T,T) | = [T:F], 


is finite, this composite is a finite one. For each o € Hom,(T,T), the 
map o:T—oT is an F-automorphism. The normality of F < S, implies 
that oS, =S, and since Sy <T is Galois and Gs, (T) solvable, it 
follows that Sy, < oT is Galois and Gs, (oT) is solvable. 

According to Theorem 5.5.3, the extension S, <T"° is Galois and 
Gg_ (T°) is isomorphic to a subgroup of the product [] Gg (oT) and 
since this is a finite product, it is solvable. Finally, since 


G,(T"*) 
Ge(Sk) ~ Gop 
K 
and both Gp(Sx) and Gg (T°) are solvable, so is Gp(T"°), whence 
F < E is solvable. & . 
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12.3 Solvability by Radicals 


Loosely speaking, when char(F) = 0, an extension F < E is solvable 
by radicals if it is possible to reach E from F by adjoining a finite 
sequence of n-th roots of existing elements. More specifically, we have 
the following definitions, which also deal with the case where 


char(F) # 0. 


Definition Let expchar(F) = p and let F < R. A radical series for F < R 
is a tower of fields 


F=R)<R,<---<R,=R 


such that each step R; < R;,, is one of the following types: 
Type 1: R,,, =R,(G;) where ; is an 1,-th root of unity. 


i 
Type 2: Ri,, =R,(a;) where a; is a root of xi-u,, with 14 u, ER; 


and (r;,p) = 1. 


Type 8: (For p>1 only) Ri,;=R,(a;) where a, is a root of 
xP —x—u,, with u; € R;. 


For steps of types 1 and 2, the number 1; is the exponent of the step. 
The exponent of a type 3 step is p. 0 


Note that if expchar(F) = p #1 and @ is an r-th root of unity where 
r= mp* and (m,p) = 1 then @ is also an m-th root of unity. Hence, we 
may assume that in a type 1 extension, the exponent r; is relatively 
prime to the characteristic p. 

Note also that lifting a radical series gives another radical series with 
the same type steps, for if R,,, = R,(a), where a is a root of f(x) € 
R,[x], then 


KR; 41 = (KR;)(@) 
where @ is a root of f(x) € (KR,)[x]. 


Definition A radical extension is a finite separable extension F < R that 
has a radical series. A finite separable extension F < E is solvable by 
radicals if there exists a radical extension F < R containing E, that is, 
F<E<R.0 


Theorem 12.3.1 The class of extensions that are solvable by radicals is 
distinguished. If F < E is solvable by radicals then so is F < E®° where 
E™< is the normal closure of E over F. 
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Proof. Let F < E be solvable by radicals, with associated radical series 
{R,}. Thus, F< E<R. Let F< K be any extension. Lifting the series 
by K gives a radical series {KR,} from K to KR containing KE, whence 
K < KE is solvable by radicals. 

Now let F< K < E with F< K and K < E solvable by radicals. Let 
{R,} be the radical series for F <R containing K and let {S;} be a 
radical series for K < S containing E. We lift the series {S;} by R to get 
a radical series {RS;} for RK < RS containing RE. Since RK = K, the 
series {R,}, followed by the series {RS;}, is a radical series for F < RS 
containing EK. Thus, F < EK is solvable by radicals. 

If F <K <E and F < E is solvable by radicals then a fortiori F < K 
is solvable by radicals. If {R;} is a radical series for F < R containing E 
then {KR,} is a radical series for K < KR containing KE = E, whence 
K < E is solvable by radicals. 

For the last statement, let F <E<R where F < R is radical. Let 
oe Hom,(E,E). Since E<R is algebraic, we may extend o to TE 
Hom,(R,E). Since ¢:R-@(R) is an F-isomorphism if {R,} is a radical 
series for F < R then {@R;} is a radical series for F < GR containing cE. 
Hence, F < cE is also solvable by radicals. Since E"° = VcE is a finite 
composite, it follows that F < E"° is solvable by radicals. § 


Now we come to the key result that links the concepts of solvable 
extension and solvability by radicals. 


Theorem 12.3.2 A finite separable extension F <E is solvable by 
radicals if and only if it is solvable. 


Proof. Suppose first that F < E is solvable. Let S be a field for which 
F<E<S where F<S is finite, Galois and G= G,(S) is solvable. 
Thus, there is a normal series decomposition 


(12.3.1) {e} =G)<G,<G,<-<G,=G 


where G,<G;,, and G;,,/G; is cyclic of prime order 1, dividing |G|. 
Taking fixed fields gives 


(12.3.2) F=F(G)<FG,_,)<:---< F(Go) < F({e}) =$ 


Unfortunately, since the appropriate roots of unity may not lie in these 
fields, we cannot apply the relevant theorems (11.1.1 and 11.2.1) of the 
previous chapter to conclude that this is a radical series. Hence, we first 
adjoin the necessary r,-th roots of unity. 

If G; < G,,, is a step in the series (12.3.1) then the corresponding 
step in (12.3.2) has prime degree 
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r = (F(G;):F(Gi44)] 
dividing [S:F]. So let 
[S:F] = n = mp* 


where p = expchar(F) and (m,p) = 1 and let w be a primitive m-th root 
of unity. If we show that F(w) < S(w) is solvable by radicals then since 
F < F(w) is a type 1 extension, it follows that F < S(w) is solvable by 
radicals and therefore so is F <S. Since F(w) < S(w) is a lifting of the 
finite, solvable Galois extension F <S by F(w), it is also finite, solvable 
and Galois. Note also that [S(w):F(w)]|(S:F] and so if r#p is any 
prime dividing [S(w):F(w)], then r|m and so F(w) contains a primitive 
r-th root of unity. 

Thus, the extension F(w) < S(w) is finite, Galois and solvable and 
F(w) contains a primitive r-th root of unity for any prime r # p that 
divides [S(w):F(w)]. We need to show that F(w) < S(w) is solvable by 
radicals. In view of this, we may as well assume to begin with that F 
contains a primitive r-th root of unity for any prime r#p dividing 
[S:F]. 

Referring to Equation (12.3.2), consider the Galois correspondence on 
the finite Galois extension F(G,,,) <5. Since F(G;) is an intermediate 
field and G,;<G;,,, Theorem 5.4.1 implies that F(G,,,) < F(G;) is 
Galois and 


Gia, , IG) ~ Gj41/G; 


which is cyclic of prime order r,; dividing [S:F]. To simplify the 
notation, let 


F(Gi41) =L, F(G;))=M and r,=r 


Then G,(M) is cyclic of prime order r dividing mp*. 

If r=p, Theorem 11.2.1 implies that there exists an a € M for which 
M = L(a), where a is a root of xP -x—u for some u€ L. Thus, L<M 
is an extension of type 3. If r#p then r|m and so L contains a 
primitive r-th root of unity. Theorem 11.1.1 then implies that M = 
.L(a), where @ is a root of x"—u for some u€ L. Hence, L <M is an 
extension of type 2. Thus, each step in the tower (12.3.2) is of type 2 or 
type 3 and we conclude that F <S is solvable by radicals, as desired. 

For the converse, suppose that F < E is solvable by radicals. Then 
F < E™ is Galois and solvable by radicals. Let 


F=R)<R,<-::<R,=R 
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be a radical series for F<R containing E"°. We wish to adjoin 
appropriate roots of unity, lifting the series to one in which each step is 
cyclic. Then, by tacking on a front end, we get a series with cyclic steps 
that begins with F and goes past E. 

Let r be the least common multiple of all of the exponents in the 
series {R;} and let w be a primitive r-th root of unity. If R; < R,(a;) is a 
step of type 1, then a; is an r,-th root of unity where r,;|r and so 
R,(a;,w) = R,(w). Hence, lifting {R,} to {R,(w)} eliminates all steps of 
type 1. (We remove any trivial steps of degree 1.) 

If R; < R,(q;) is a step of type 2, then a; is a root of x it u; and 
since R,(w) contains a primitive r,-th root of unity, Theorem 11.1.1 
implies that R,(w) <R,(w, a) is cyclic. Finally, Theorem 11.2.1 
guarantees that if R; < R,,, is of type 3, then R,(w) < R;,4(w) is cyclic. 

Thus, each step in the tower 


F < F(w) = Ro(w) < +--+ < Ry(w) = R(w) 


is abelian, all steps after the first one being cyclic. Taking Galois groups 
gives a series 


(12.3.3) 
{e} = Gayy(R()) < Gp (uy(R()) < +++ < GRi(uy(R(w)) < Gp(R(Y)) 
Since R; < R;,, is normal, so is R,(w) < R,,,(w) and so 
Gri, (wR) 4 GRi.)(R)) 
and the quotient group is 


Gr (uy(R(4)) 
GR & mes Bisa) 


Ri 44 (+) 
which is abelian. Thus, Equation (12.3.3) is an abelian normal series for 
G,(R(w)) and so F < R(w) is solvable. Hence, F < R is solvable. & 
12.4 Polynomial Equations 
The initial motivating force behind Galois theory was the solution of 


polynomial equations f(x) =0. Perhaps the crowning achievement of 
Galois theory is the statement, often phrased as follows: there is no 
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formula, similar to the quadratic formula, for solving polynomial 
equations of degree 5 or greater over Q. However, this is not the whole 
story. The fact is that, for some polynomial equations, there is a 
formula and for others there is not and, moreover, we can tell by 
looking at the Galois group of the polynomial whether or not there is 
such a formula. In fact, there are even algorithms for solving 
polynomial equations when they are “solvable,” but these algorithms 
are unfortunately not practical. 

Let us restrict attention to fields of characteristic 0. We refer to the 
four basic arithmetic operations (addition, subtraction, multiplication 
and division) and the taking of n-th roots as the five basic operations. 

Let C be a field. We will say that an element a € C is obtainable by 
formula from C if we can obtain @ by applying a finite sequence of any 
of the five basic operations, to a finite set of elements from C. 

Suppose we can obtain any element from the field K by formula from 
C. Applying any of the four basic arithmetic operations to the elements 
of K gets us nothing new. However, taking an n-th root of an element 
a € K gives us access to all elements of L = K(./a), since any element 
of L is a polynomial in fa over K. Hence, repeated use of the five basic 
operations allows us to obtain any element lying within any finite tower 
of the form 


(12.4.1) C=Fy)<F,<Fy<::'<F, 


where F;,, = F;(a;), with a; a root of a binomial xi- u, over F;. Since 
we are assuming that char(F) =0, the tower (12.4.1) is just a radical 
series for C < F,. Hence, we can obtain by formula any element in any 
radical extension C < R of C. 

On the other hand, let a € R where Equation (12.4.1) is a radical 
series for C<R. Then w€F,=F,_,(0;), where a;=\/i,_,, with 
u;_; € F;_,. Since @ is a polynomial in a; over F;_,, it follows that a 
can be obtained by formula from F;_,. It is now clear that any element 
of R can be obtained by formula from C. 


Theorem 12.4.1 Let C be a field of characteristic 0. An element a € C 
can be obtained by formula from C if and only if a@ lies in a radical 
extension of C, that is, if and only if C < C(a) is solvable by radicals. 0 

Let us say that a root a of a polynomial f(x) = ag +a,x+-:: +a4x? 
over F is obtainable by formula if we can obtain a by formula from 
C = Q(ap,...,4q) Thus, a root a of f(x) is obtainable by formula if and 
only if C <C(a) is solvable by radicals. Theorems 12.3.1 and 12.3.2 
now imply the following. 
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Theorem 12.4.2 Let char(F) =0 and let f(x) = ap tayx+e+++agxt be 

a polynomial over F. 

1) The roots of f(x) are obtainable by formula if and only if the 
extension C <S is solvable, where C = Q(ag,...,ag) and S is the 
splitting field for f(x) over C. ° 

2) Let f(x) be irreducible over F. One root of f(x) is obtainable by 
formula if and only if all roots of f(x) are obtainable by formula. 0 . 


According to Theorem 10.5.7, for any prime number p, there exists a 
polynomial f(x) of degree p over Q whose Galois group is isomorphic 
to S_. Hence f,(x) is irreducible and since the group S,, is not solvable 
for p>5, Theorem 12.4.2 implies that if p > 5, then none of the roots 
of f,(x) can be obtained by formula. Although it is much harder to 
show, this also holds for any positive integer n [see Hadlock, 1987]. 
Thus, we have the following. 


Theorem 12.4.3 For any n> 5, there is an irreducible polynomial of 
degree n over Q none of whose roots are obtainable by formula. 0 


As a consequence, for any n > 5, there is no formula, similar to the 
quadratic formula, for the roots of any polynomial of degree n. More 
specifically, we have 


Corollary 12.4.4 Let n> 5 and consider the generic polynomial p(x) = 
Yoty,X+--++y,x", where yo,...,y, are algebraically independent over 
Q. Then there is no algebraic formula, involving only the five basic 
operations, the elements of Q and the variables yo,...,y,, with the 
property that, for any polynomial f(x) = aj + a,x +---+a,x” of degree 
n over F, we can get a root of f(x) by replacing y; in the formula by a,, 
for alli =0,...,n.0 


Exercises 

1. Prove that if H«G then G is solvable if and only if H and G/H 
are solvable. 

2. Prove that if F <E is solvable by radicals and o € Hom,(E,E) 
then F < cE is also solvable by radicals. 

3. Calculate the Galois group of the polynomial f(x) = x>— 4x +2, Is 
there a formula for the roots? 

4. Prove that if f(x) is a polynomial of degree n over F with Galois 
group isomorphic to S,, then f(x) is irreducible over F. 

5. A finite separable extension F < E of characteristic p is solvable 
by radicals if and only if there exists a finite extension F < R with 
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F <E<R and a radical series {R,} for F < R in which each step 
R; < Rj,, is one of the following types: (1) R,,, = R,(w;) where w; 
is an x;-th root of unity with r, prime and 1; # p. (2) Ri, , = R,(a;) 
where a; is a root of x"—u, with u € R,, r prime and r # p. (3) (If 
p>0 only) Ri,, =R,(4;) where §; is a root of the irreducible 
polynomial xP —x—u, with u € R;. 

6. Prove Theorem 12.4.2. Hint: for part 2), consider the normal 
closure of C(a), where a is an obtainable root of f(x). 

7. Let f(x) be an irreducible cubic over Q with three real roots. Show 
that no root of f(x) can be obtained by formula if we allow only 
real n-th roots. (That is, no root of f(x) is contained in a radical 
series whose fields are subfields of R.) Hint: Use the fact that the 
splitting field for f(x) over Q is given by Q(./A,r), where r is a 
root of f(x) and A is the discriminant. 


Chapter 13 
Binomials 


We continue our study of binomials by determining conditions that 
characterize irreducibility and describing the Galois group of a binomial 
x"—u in terms of 2x2 matrices over Z,. We then consider an 
application of binomials to determining the irrationality of linear 
combinations of radicals. Specifically, we prove that if p,,...,p,, are 
distinct prime numbers, then the degree of 


Q(YPir- ++ / Pm) 


over Q is as large as possible, namely, n™. This implies that the set of 


all products of the form 
4 ps) 7 ps?) ies qy pe) 


where 0 < e(i) <n—1, is linearly independent over Q. For instance, the 
numbers 


1, Y3H=V38, 9/4 = 9/24 and 8/72 = W/230320 


are of this form, where p, = 2, py =3. Hence, any expression of the 


form 
a3 + ayv/4 + ay\/72 


where a; € Q, must be irrational, unless a; = 0 for all i. 
First, a bit of notation. If uéF, then u!/” stands for a particular 
(fixed) root of x" —u. The set of primitive n-th roots of unity is denoted 
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by Q,, and uw, always denotes a primitive k-th root of unity. 


13.1 Irreducibility 


Let us first recall a few facts about the norm. Let F < E be finite 
with a € E. If the minimal polynomial of a 


min(a,F) = xd 4 ag_ xt} +++ ag 
has roots r,,...,Tg then 


d 
N(a) = [] x = (-1)%ap 


i=1 


where N = Np, a)/F: Note that N(a) € F. Also, for all @ € F(a) and a€ 
F, we have 


1) N(#")=N(G)", na positive integer, 
2) N(aB) = a“N(8), 
3) N(a) =a. 


We begin with Lemma 4.7.6, restated here for convenience. 


Lemma 13.1.1 If char(F)=p#0 and ueéF, ugF? then xP _u is 
irreducible for every k > 1. 


Proof. If 2 is a root of f(x) = x"_ u then, in a splitting field, 
f(x) = (x— A) 


Since p(x) = min(@,F) divides f(x), we have p(x) = (x— py for some 
d <k. Since the constant term ge of p(x) lies in F, ifd <k—1 we get 


us ph = (gp ERP 
contrary to assumption. Hence d = k and f(x) = p(x) is irreducible. § 
We turn next to primes different from char(F). 
Lemma 13.1.2 Let p be a prime different from char(F). If ue F, u ¢ FP 


then xP —u is irreducible over F. Thus, x? —u is irreducible over F if 
and only if it has no roots in F. 


Proof. Assume that u¢FP and let a be a root of xP—u with 
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F(a):F] = d < p. Since a? = u, applying the norm N = N gives 
F(a)/F 
[N(a@)]P = N(aP) = N(u) = ut 


Letting N(a) =v EF gives v? = u4. If p<d then (d,p) =1 and there 
exist integers a and b for which ad + bp = 1. Hence 


u= yadt+bp = u24y>P e y8PybP = (v2ubyP € FP 


a contradiction. Thus p=d and xP —u = min(a,F) is irreducible. The 
second statement follows from the first. 


For p #2, the previous result (and its proof) extends more or less 
directly to prime powers p*, that is, if u¢ FP then 
k 
xP —u 
is irreducible over F. However, the case p = 2 is not quite as simple. 
Since for any nonzero b € Q, we have —4b* ¢ Q? but 
x4™ 4 4b4 = (x?™ 4 2bx™ +4 2b?)(x2™ — 2bx™ + 2b?) 


is reducible for all m >1, we must at least include the restriction (for 
4| p*) that u cannot have the form -4b‘ for any b €F, that is, u¢ 
-4F*, It turns out that no further restrictions are needed. 


Lemma 13.1.3 Let p be a prime, k a positive integer and uc F. Ifu¢ 
FP and if u ¢ -4F4 when 4 | p*, then 


k 
f(x) =xP -u 
is irreducible over F. 


Proof. If p = char(F), the result follows from Lemma 13.1.1, so assume 
that p # char(F). We proceed by induction on k. Lemma 13.1.2 shows 
that the result is true for k = 1 and hence that xP — u is irreducible over 
F. Assume the result is true for any positive integer less than k > 2. Let 
B be a root of f(x). In a splitting field, we have 


xP —u = (x—ay)(x — ag)-+-(x — @,) 
Hence , 


k k-1 k-1 k-1 
f(x) = xP -u=(xP = —ay)(xP — —ag)e+-(xP i) 


Thus @ is a root of one of the binomial factors, say 
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k-1 
g(x)=xP  -@ 


k-1 
where a = a; for some i. Since a= 8P _, we have the tower 
F < F(a) < F(A) 


where [F(a):F] = p. If g(x) is irreducible over F(a), it will follow that 
(F(8):F(a)] = pk"! and so [F(@):F] = p*, whence f(x) = min(,F), 
which is irreducible. We must now consider a few cases. 


Case 1: p#2. To show that g(x) is irreducible over F(a), we verify 
that a ¢ F(a)?. Suppose to the contrary that a = 7? € F(a)? for some 
7 €F(q@). Since min(a,F)=x?—u, applying the norm N= Ne(a)/F 
gives 


—u = (-1)PN(a) = (-1)PN(7?) = (-1)PIN()P 


Since p is odd, we get u = [N(7)]? € FP, contrary to assumption. Hence 
a ¢ F(q)?P, g(x) is irreducible over F(a) and f(x) is irreducible over F. 


Case 2: p=2. If ag F(a)? and a¢-4F(a)‘, then the induction 
hypothesis shows that g(x) is irreducible over F(a), so we need to 
consider two subcases. 


Case 2a: p = 2, a = 7” € F(a)? for some 7 € F(a). 
We show directly that f(x) is irreducible over F. If N = Nay a)/F then 
since min(a,F) = x? —u, the usual norm computation gives 


-u = (-1)?N(a) = N(7?) = [N(7)]? 


Setting N(y) =b€F gives -u = b* € F?. Since u ¢ F?, we get -1 ¢ F?. 
In other words, i ¢ F, where i is a root of x? + 1. Over F(i), we have the 
factorization 


k k k— k~-1 
(13.1.1) f(x) =x? -u=x? +b? = (x? : + ib)(x?- — db) 


If both of the factors on the right side are irreducible over F(i), then 
f(x) cannot factor nontrivially over F. For if f(x) = []a,(x) is a 
nontrivial factorization, where the a,(x) are irreducible over F, then one 
of the factors has degree at most 2*-!, and is not one of the factors in 
(18.1.1). Factoring each a;(x) into irreducibles over F(i) would then 
produce a prime factorization over F(7) distinct from (13.1.1), which is 
not possible since F[x] is a unique factorization domain. 

Now, if one of the factors in (13.1.1) is reducible, the induction 
hypothesis implies that one of ib or -%b lies in F(i)? or -4F(i)* = 
[2iF(i)”]?. In either case, one of ib or -ib is in F(i)*, say 
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+ ib =(c+di)? =c? + 2cdi-d? 


Thus, c? = d? and b? = 4c?d? = 4c’. It follows that u = -b? = —4c4, a 
contradiction to the hypothesis of the lemma. Thus, f(x) is irreducible 
over F. 


Case 2b: p = 2, a ¢ F(a)” but a = -4y4, for some 7 € F(a). 
Since a has degree 2 over F, taking norms gives 


-u=N(a)= N(-474) = 16[N(y)]* 


and so -u € F’. Hence, -u =a? for a€F and so -1 =a?/u=a’/a’ € 
F(a)’, say -1 = i”, with i € F(a). Then 


a = 474 = (2iy*)* € F(a)? 
a contradiction. Hence, this case cannot occur. fl 
Now we can prove the main result of this section. 
Theorem 13.1.5 Let n > 2 be an integer and let u € F. The following are 


equivalent. 


1) f(x) = x" —u is irreducible over F. 
2) uéFP? for all primes p|n and u ¢ -4F* when 4|n. 


In particular, if 4/n, then x"—u is irreducible over F if and only if 
xP — u is irreducible over F, for all primes p | n. 


Proof. The last statement follows from Lemma 13.1.2. Proof of 1)=>2) is 
left to the reader. For the converse, we have seen that this result holds 
if n = p* is a prime power. Suppose that n = p*m where (p,m) = 1 and 
k > 1. We may assume that p is odd, for if 2 is the only prime divisor 
of n then n = 2* is a prime power. We proceed by induction on n. Let 3 
be a root of x" —u. In a splitting field, we have 


x™—u = (x —a,)(X — dy):+-(x — a) 
Thus 


k k k k 
f(x) =x®-u=x™P —u=(xP —a,)(xP —ay):+(xP —a,) 


Suppose that f is a root of 


k 
g(x) =xP -a 


where a = a; for some i. By induction, x™— u is irreducible over F and 
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so the first step in the tower 
F < F(a) < F(f) 


has degree m. af g(x) is irreducible over F(a), then the second step will 
have degree p*, whence [F(6):F] = mp* =n and f(x) = min(@,F), which 
is irreducible. 

We apply the inductive hypothesis to show that g(x) is irreducible. 
Since p is odd, we need only show that a ¢ F(a)?. If a= 7? for some 
7 € F(a) then taking norms N = Ne(a)/F 8ives 


—u = (-1)™N(a) = (-1)"N (7?) = (-1)™ING)IP 


If m is odd, we get u = [N(y)] € F?, a contradiction. If m is even then 
since p is odd, we have u = [-N(7)]? € FP, again a contradiction. Hence, 
a ¢ F(a)P, g(x) is irreducible over F(a) and f(x) is irreducible over F. 


13.2 The Galois Group of a Binomial 


Let us now examine the Galois group of a binomial x" — u over F, for 
u #0 and n relatively prime to expchar(F). If a is a root of x"—u and 
w €Q,, then the roots of x"—u are a, wa,...,w"-la and so S = F(w,a) 
is a splitting field for x" —u over F. Moreover, in the tower 


(13.2.1) F < F(w) < F(w,a) = 


the first step is a cyclotomic extension, which is abelian since its Galois 
group is isomorphic to a subgrou up of & The second step is cyclic of 
degree d|n with min(a,F(w)) =x*?—a4 . Nevertheless, the Galois group 
G,(S) need not be abelian. 

The fact that a and w both satisfy simple polynomials over F is the 
key to describing the Galois group G,(S). Since any o € G,(S) must 
permute the roots of x" —u, there exists an integer k(o) € Z, for which 


oa =wklg 


Moreover, since F(w) is a normal extension of F, the restriction of o to 
F(w) is in Gp(F(w)) and therefore o sends w to another primitive n-th 
root of unity, that is, 


ow = wil?) 


where j(a) € Z*. 
Multiplication in G,(S) has the following form. For a, rt € Gp(S), 
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Sti o(uk(7)q) = slo)k(7) UK(o), whlo)k(r) +k(o) 
and 


orw = owl") = yiloilr) 


There is something reminiscent of matrix multiplication in this. Indeed, 
let Ab, be the set of all matrices of the form 


1 0 
Ab, = f >: keZ jez 
k j 
Since 
1 0 1 0 1 0 
ee Oe ae ae ee 


we see that Ab, is a subgroup of the general linear group GL,(Z,) of all 
nonsingular 2x2 matrices over Z,. (All entries are taken modulo n.) 
Comparing this product with the action of the product or shows that 
the map ¥:Gp(S)—Ab,, defined by 


1 0 


EN Cait, 


satisfies 
vor) = ¥(c)¥(7) 


and is, in fact, a monomorphism from G,(S) into Ab,. 
Since | A, | =nd¢(n), where ¢ is the Euler phi-function, the map » 
is surjective if and only if 


[S:F] = | Gp(S)| =n¢(n) 
But in the tower 
F < F(w) < F(w,a) =S 
we always have [F(w):F] <¢(n) and [F(w,a):F(w)] <n. (See Figure 


13.2.1.) Hence % is surjective (and an isomorphism) if and only if 
equality holds in these two inequalities. 
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Figure 13.2.1 


Theorem 13.2.1 Let n be a positive integer relatively prime to 
expchar(F). Let S be the splitting field for x*—u over F, where u € F, 
u #0. Let a be a root of x°—u and wE,. Then G,(S) is isomorphic 
to a subgroup of the group Ab, described above, via the monomorphism 


1 0 


aa One 


where ca =u*(q and ow =w(%), In addition, w is an isomorphism 
and G,(S) ~ AL, if and only if both steps in the tower (13.2.1) have 


n 
maximum degree, that is, if and only if both of the following hold 


1) [F(w):F] = ¢(9), 
2)  [F(w,a):F(w)] =n, that is, x" —u is irreducible over F(w). 0 


Statement 2) is phrased in terms of F(w) and we would prefer a 
statement involving only the base field F. For n prime, this is easy. 


Lemma 13.2.2 Let p be a prime and let wE2,. Then x?—u is 
irreducible over F(w) if and only if it is irreducible over F. Equivalently, 
xP — u has a root in F(w) if and only if it has a root in F. 


Proof. Certainly, if x" —u is irreducible over F(w), it is also irreducible 
over F. For the converse, consider the tower 


F < F(w) < F(u,a) 
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Since xP — u is irreducible over F, we have 
= [F(a):F] < [F(w,):F] 


On the other hand, the first step in the tower has degree at most 
¢(p) = p—1 and the second step is cyclic of degree d|p, whence d = 1 
or p. Hence (F(w,a):F(w)] = p, which implies that x? — u = min(a,F(w)) 
is irreducible over F(w). 


In order to extend this result to arbitrary n (and for its own 
interest), we want to say more about when the Galois group G,(S) is 
abelian. Of course, since both steps in the tower 


F < F(w) < F(w,a) =S 


are abelian, if either step is trivial, then G,(S) is abelian. Thus, if w € F 
or if a € F(w) then Gp(S) is abelian. The converse is also true when n is 
prime. 


Lemma 13.2.3 Let p be a prime and let w € 0. Then the Galois group 
G,(S) is abelian if and only if at least one step in the tower (13. 2.1) is 
trivial, that is, if and only if either w € F or xP —u has a root in F(w) 
[or, equivalently, a root in F]. 


Proof. One direction has already been discussed so we need only show 
that if w¢F and xP—u is irreducible over F(w) then Gp(S) is not 
abelian. Since w ¢ F, it has a conjugate wi # w that is also not in F. Let 
T € Gp(F(w)) be defined by rw = wi, Since xP—u is irreducible over 
F(w), for each i€ Z,, the map r may be extended to a o; € Gp(S) 
defined by 


7) = ll 
ow=w, oa=wa 
Taking i = 1 and i’ = 0 gives 
and 
09010 = Oo(wa) = Wa 


and these are distinct since w ~ wl, Hence,.o, and gy do not commute 
and G,(S) is not abelian. & 


We can now strengthen the statement of Theorem 13.2.1 by showing 
that, in certain cases, when n is odd and [F(w):F] = ¢(n), then x"—u is 


236 13 Binomials 


irreducible over F(w) if and only if it is irreducible over F. The idea of 
the proof is this. Suppose that p(x) is an irreducible polynomial over F, 
with splitting field S. Suppose also that E is a normal extension of F. 
Then p(x) has a root in E if and only if it splits in E, that is, if and 
only if F<S<E. Now, if F <E is an abelian extension, that is, if 
G,(E) is abelian, then so is any quotient group of G,(E), in particular, 
so is 


Gp(E) 
Gg(E) 


Thus, if G,(S) is not abelian, we can conclude that p(x) does not have 
a root in E. 

Part of the hypotheses of the next theorem is that the base field F 
does not contain any n-th roots of unity, other than 1. Note that this is 
equivalent to saying that F does not contain any primitive p-th roots of 
unity for any prime p|n. 


Gp(S) _ 


Theorem 13.2.4 Let n be an odd positive integer relatively prime to 
expchar(F). Let w be a primitive n-th root of unity over F and suppose 
that F does not contain a primitive p-th root of unity for any prime 
p|n. Let F <A be any abelian extension. Then x"—u is irreducible 
over F if and only if it is irreducible over A. 


Proof. Clearly, if x" —u is irreducible over A, it is also irreducible over 
the smaller field F. Suppose that x"—u is irreducible over F, but not 
over A. Since 4f/n, Theorem 13.1.5 and Lemma 13.1.2 imply that there 
exists a prime p|n for which no roots of xP —u lie in F, but some root 
a of xP—u lies in A. Hence, Lemma 13.2.3 implies that if € is a 
primitive p-th root of unity, then the Galois group Gp(F(€,a)) is not 
abelian. 

On the other hand, since F < A is normal and A contains one root of 
the irreducible polynomial x? — u, it contains all roots of xP — u. Thus, 


F<F(E,a)<A 
But F < A is abelian and therefore so is the quotient 


Gp(F(E,a)) =a 


This contradiction implies that x" — u is irreducible over A. 


According to Theorem 10.2.9, if [F(w):F] = ¢(n), then F cannot 
contain any primitive p-th roots of unity for any p|n and we may 
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apply Theorem 13.2.4 to get the following strengthening of Theorem 
13.2.1. 


Corollary 13.2.5 Referring to Theorem 13.2.1, if n is an odd positive 
integer relatively prime to expchar(F) then Gp(S) ~ Ab, if and only if 
[{F(w):F] = ¢(n) and x” — u is irreducible over F. 0 


Since [Q(w):Q] = ¢(n), we have 


Corollary 13.2.6 Referring to Theorem 13.2.1, if F = Q and n is an odd 
positive integer then GQ(S) ~ At, if and only if x"—u is irreducible 
over Q. 0 


n 


Thus, when F < F(w) has the largest possible degree ¢(n) (which 
includes the important case F = Q), we see that G,(S) ~ A, if and 
only if x"—u is irreducible over F. In some sence, Ab, is the “most 
nonabelian” subgroup of AL,. At the opposite extreme, we can show, 
again when [F(w):F] = ¢(n), that Gp,(S) is abelian if and only if x"—u 
actually has a root in F. 


Theorem 13.2.7 Let n be an odd positive integer relatively prime to 
expchar(F). Let S be the splitting field for x*—u over F, where u€ F, 
u #0. Suppose that [F(w):F] = ¢(n) where w €2,. Then the following 
are equivalent. 

1) G,(S) is abelian 

2) x™—Uu has a root in F 

3) x™—Uu has a root in F(w) [and therefore splits in F(w)] 


Proof. Clearly, 2)=>3)=>1). Suppose that Gp(S) is abelian and let k be 
the largest divisor of n for which u € F*, that is, u = f* for some f € F. 
The proof will be complete if we show that k =n. If k <n, let p be a 
prime number dividing n/k. Consider the tower 


(13.2.2) F < F(w,) < F(wp,f"/?) 


Note that xP —f is irreducible over F, for if not, then f=g? € FP for 
some g@€F, whence u= f* = gPk eFPk in contradiction to the 
definition of k. Hence [F(f!/?):F] = p and 


[F(w,,f/?):F] > p 
Theorem 10.2.9 implies that [F(w,):F])=p-—1 and __ since 
F(w,) < F(w,,f/ P) is cyclic of degree dividing the prime p, neither step 
in the tower (13.2.2) is trivial. Hence, Lemma 13.2.3 implies that the 
Galois group H = Gp(F(wp sf! P)) is not abelian. 
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We will now produce a contradiction by showing that G,(S) abelian 
implies H is abelian. Since each root of x? —f is a root of xP*—u we 
have 


F< F(w,,f'/) < F(wpjyu/Pk) 


Since (u!/™)/PK is a root of xPK—u, at least one root of xP*—u is in 
F(w,u!/*), But w/Pk = Wp, and so all roots of xPk _ y are in F(w,u!/?). 
Hence, 


F< F(wp,f'/?) < F(wpyt'/PK) < F(w,u!/*) 


Since F < F(w,ul/ ™) is assumed to be abelian, so is the subextension 
F< F(wp,f/ P), that is, H is abelian. This contradiction completes the 
proof that 1) implies 2). § 


In the exercises, we ask the reader to provide a simple example to 
show that Theorems 13.2.4 and 13.2.7 fail to hold when n is even. 

We conclude this section by generalizing the previous theorem, in 
order to characterize precisely (for n odd) when G,(S) is abelian. The 
proof follows lines similar to the proof of Theorem 13.2.7, but is a bit 
more intricate and since it involves no new insights, the reader may 
wish to skip it on first reading. However, the result is of interest since it 
shows how the relationship between the n-th roots of unity and the 
ground field F play a role in the commutativity of Gp(S). We first need 
a result that is of interest in its own right. The proof is left as an 
exercise. 


Theorem 13.2.8 Let x" —a and x" —b be irreducible over F and suppose 
that F contains a primitive n-th root of unity. Then x"—a and x"—b 
have the same splitting field over F if and only if b = c"a‘ for some c € 
F and r relatively prime to n. 0 


Theorem 13.2.9 Let n be an odd positive integer relatively prime to 
expchar(F). Let U,, be the group of n-th roots of unity over F and let 
U,, = U, NF". If S is the splitting field for x"—u (ueEF, u#0), then 
G,(S) is abelian if and only if u™ € F”. 


Proof. Note first that m|n since U,, is a subgroup of U,. Moreover, 
since U,, = (w,,) is cyclic, w; € F if and only if i|m. Suppose first that 
u™ = {” for some f € F. Then 


y/n = wk fi/m 


1/n 


for some integer k. (More precisely, given any n-th root u/" of n and 
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any m-th root f!/™ of f, there exists a k such that this equation holds.) 

The field F(f/ ™) is cyclic over F, since the latter contains a primitive 

m-th root of unity w,,. Therefore, since F < F(w,,,) and F < F(fl/ ) 

are both abelian, so is the extension 
F< F(Wyyn)F(P/™) = Fant!) = Fart!) 

Finally, since F< S< F(w,,,00/ ), we deduce from Theorem 5.5.5 that 

F <S is abelian. 

For the converse, assurne that G,(S) is abelian. Let k be the largest 
positive integer such that m|k, k|n and u™€ FX, say u™ = f* for fe 
F. (There is such an integer since k = m satisfies these conditions.) We 
need to show that k =n. Suppose to the contrary that k <n and let p 
be a prime number dividing n/k. Let p® be the largest power of p such 
that p*|m. (The hypothesis that n is odd and [F(w):F] = ¢(n) in 
Theorem 13.2.7 implies that m = 1, whence s = 0.) 

The first step is to show that the extension 


s+1 
F<F(w sy!” ) 


is abelian. It is clear that the notation is a bit unwieldy, so let us set 
q=p*t! and note that q|n since p$|m|[k and p|(n/k). To see that 
this extension is abelian, we embed it in an abelian extension. Since 


(ft/a)ka —-fKiy™= (um/kayka 
we have f!/4 = wh gu! kq for some j and so 


F(waf/9) < F(Wqpf/9) = F(wyequ™/*9) 
If we set 
a (ul/nypm/ka 


then v is a root of x*V/™_u and ve F(wy, ju!/"), Hence, all roots of 


xa/™ _ y are contained in F(wyqyu'! ”), that is, 
F(wyq su kay < F(wyqou'! ”) 
Putting the pieces together gives 
F< F(waf'/9) < F(w,q0™/*9) < F(wq/™) < F(wa,)F(w,su!/*) 


Since F< F(wy,) and F < F(w,,u'/*) are abelian (the latter by 
assumption), the composite 
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F< F(wgJF(w yu!) 
is abelian and therefore so is 
F < F(waf'/) 
We now propose to arrive at a contradiction by considering the tower 
F < F(wg) < F(wgsf!/9) 


Note that xP-f is irreducible over F, since otherwise f = g? € FP for 
some g€F, whence u= fk = gpk €FPk in contradiction to the 
definition of k. 

We first take the case s = 0, whence q = p. Since xP —f is irreducible 
over F, we have (F(f!/?):F] = p and 


[F(w,,f'/P):F] > p 


Since p{'m, it follows that w, ¢ F and so the extension F < F(wp) j is not 
trivial. Since [F(w,): FJ< <p-1 and F(w,) < F(w,,f 1/P) is cyclic of 
degree dividing the prime p, the latter extension” is also not a 
Hence, Lemma 13.2.3 implies that the Galois group H = Gp(F(w, ift/ Ph) 
is not abelian, the desired contradiction. 

Now assume that s>0. With regard to the first step in the tower, 
letting r= p* > p, we have r|m and q{m, hence w, € F but w, ¢ Un. 
Since s > 0, we also get w, € F. Hence xP—w, is either irreducible or 
splits in F. But w, is a root not in F and so xP —w, is irreducible over 
F. (Note that for s > 0, the first step in the tower has degree p, rather 
than a number dividing p — 1, hence we cannot use the same strategy as 
when s = 0.) Since the roots of xP —w, are 


w Wp Way ees WP hw, 
for each jE Z,, there is a 0; € Gp(F(w,)) for which ojw, = wh, To 
show that Gp F(w, f/9)) is not abelian, we shall need only os Oo: war 
and oy :w gw, 

There : are fee possibilities for the second step in the tower. If x4—f 


is irreducible over F(w,) then we can extend og and a, to elements of 
Gp(F(waf'/9)) by defining 


q 


3 fi/a, 1/q 
Ty, 1WgrWgs orf wef 


and fail 
Fy giWgt Wg, 71 o:f a i 
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Then 


1/q_ ‘/a_,, pi/a 
%1%1,0f = orf = Wf 


and 


oroit t= 01 (Wat! 2) = ww, fl! a 


which are distinct since w, # 1. Hence, Gp(F (waft! 4)) is not abelian, a 
contradiction. 

If x1—f is reducible over F(w,) then f€ F(w,)?. Thus f= 6? for 
some # € F(w,) and so F(f) < F(w,). Since xP —w, and xP —f are oe 
irreducible over F, it follows that [F(w,):F]=p and [F(8):F] = 
whence F(w,) = F(8). Thus, x? —f and xP w, have the same splitting 
field over F aad Theorem 13.2.8 implies that 


f= wi.yP 
for some v € F. Taking k-th powers gives, since r | k, 


m _. k Iki yep = ykP 


for v € F, which contradicts the definition of k. Thus, k =n and the 
theorem is proved. #f 


*13.3 The Independence of Irrational Numbers 


A familiar argument (at least for p = 2) shows that if p is a prime 
number then ,/p ¢ Q and so [Q(,/p):Q] = 2. Our plan in this section is 
to extend this result to more than one prime p and to n-th roots for 
n> 2. Since the case when n is even involves some rather intricate 
details which give no further insight into the issues involved, we will 
confine our attention to n odd. (The case n = 2 is straightforward and 
we invite the reader to supply a proof of Theorem 13.3.2 for this case.) 
If a>0 is rational, the notations Ja and a/" will denote the real 
positive n-th root of a. The results of this section were first proved by 
Bescovitch [1940] but the method of proof we employ follows more 
closely that of Richards [1974]. 


Lemma 13.3.1 Let u = a/b be a positive rational number, expressed in 
lowest terms, that is, where (a,b) = 1. If n > 2 is an integer then 


ey 5 €Q if and only ifa=c™ and b=d" for some integers c and d 
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In particular, if p is a prime, then VP ¢ Q. 


Proof. One direction is quite obvious. Suppose then that 
¢yn a 
(= 


where c and d are positive integers and (c,d) = 1. Then ad™ = bc™ and 
since (a,b) = 1, it follows that a|c™. Thus c" = aa for some integer a. 
Substituting this into ad" = bc™ gives ad" = aab or d" = ab. But since 
(c,d) = 1, we also have (c",d") = 1, that is, (aa,ab) = 1. Hence a=1 
and soa=c"™ and b=d". 8 


Suppose now that n is odd. Since p ¢ Q’ for any prime r|n, Theorem 
13.1.5 implies that x" —p is irreducible over Q and so [Q(/P):Q] =n. 
Let us generalize this to more than one prime. 


Theorem 13.3.2 Let n >2 be an integer and let p,,...,p,, be distinct 
primes. Then 


[Q 1 Bis ‘ \/Pm):Q) =n 


Proof. As mentioned earlier, we confine our proof to the case where 
n > 3 is odd. Let w €Q,. Since 


[Q(e)(2/B js» 7m): Q(e)) $ [OCB +s Pm ):Q] < n™ 


it is sufficient to show that 


[Q(%)(Y/Pis--+1 /Pm):Q(w)] = n™ 


which we shall do by induction on m. 

Let p be a prime. Since x"—p is irreducible over Q and Q contains 
no primitive r-th roots of unity for any prime r|n, Theorem 13.2.4 
implies that x" — p is also irreducible over Q(w). Hence, 


[Q(v,/P):Q(u)] =n 


and the theorem holds for m = 1. 
Now let us suppose that the theorem is true for the integer m and let 
p be a prime distinct from the distinct primes p,,...,p,,- Let 


F =Q(w) and E = Q(w)(Y/Ppy-+-s4/Pm) 


If x"—p is not irreducible over E then there exists a prime r|n such 
that p! ™€E. Thus, p!/" is a linear combination, over Q(w), of terms of 
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the form 


where 0 < e(i) <n—1. There are two cases to consider. 


Case 1: If the linear combination involves only one term, then 


where c € Q(w) and not all e(i) are 0. If n=rd, this can be written in 


the form 
"| pe Q(w) 
yoy E QW 
po). ; pelm) 


This says that the radicand q is a positive rational number and the 
polynomial x"—q has a root in Q(w). According to Theorem 13.2.7, 
x" —q must also have a root in Q, which is not possible since q does not 
have the form a"/b”, for integers a, b. Hence, this case cannot occur. 


Case 2: At least two terms in the linear combination are nonzero. It 
follows that one of the primes p,, which we may assume for convenience 
is p,,, appears to different powers in at least two distinct terms. 
Collecting terms that: involve like powers of p,, gives 


(13.3.1) p'/¥ = Ag+ Apa" + Appa” +++ + Ag pt)" 


where A; € Q(w)(Y/Pis +++ 4/Pm—a) and where at least two of the A,’s 


are nonzero. Now, since 


Qu) < Uw) p---s Pm) 


is a Galois extension (this is why we adjoined w in the first place), the 
inductive hypothesis implies that its Galois group G has size n™. Since 
any o €G must send roots of x" —p; to other roots, it must send + Pj 
to w Pi for some j = 0,...,n —1. Since there are n™ such choices, all of 
these choices must occur. 

Thus, there is a o € G for which 


opll™ = wpl/™,  opi/™ = pl/™ (for all i < m) 


Since op*/™ — ukp4/™ for some 0<k <n—1, applying o to (13.3.1) 
gives 


wkpa/n = Ag tAwphl™ + Agw*p2/P 4... AL wt tpa-D/n 
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We now multiply (13.3.1) by wk and subtract the previous equation to 
get 

0 = (wk 1)Ag + (W*—w Appa? +--+ (uk — wtp /n 
where at least one of the coefficients (wk — wi)A, is nonzero. This is a 


contradiction to the inductive hypothesis. We have therefore established 
that x" — p is irreducible over E and the proof is complete. § 


Exercises 

1. Let n be relatively prime to char(F). Show that the Galois group 
of x"—u is cian aus toa subgroup of the group generated by o, 
r where o" = r® 1, ora~* = 7". What is r? 

2. (Van der Waerden) Let n be relatively prime to char(F). Show 
that the Galois group of x"—u is isomorphic to the group of 
linear substitutions modulo n: x—cx +d where d € Z,, c € ZF. 

3. Let x"—u € GF(q)|[x]. Show that the following are equivalent: (i) 
r|n, r prime implies u ¢ GF(q)* and (ii) r|n, r prime implies 
r| o(u) but r{(q—1)/o(u) where o(u) is the multiplicative order of 
u in GF(q). 

4. Prove Lemma 13.1.2 by factoring x? —u in a splitting field and 
then considering min(a,F). 

5. Prove the following without using any of the results of Section 
13.1. If u€ F and (m,n) = 1 then x™" —u is irreducible over F if 
and only if x™—u and x" —u are irreducible over F. 

6. Let char(F) =p #0 and let F <E be cyclic of degree p*, with 
Galois group G = (c). If there exists a 6 € E with Trp j(8) =1 
show that there exists an a € E for which the polynomial f(x) = 
xP — x — aq is irreducible over E. 

7. Let char(F) = p>0 and let n = p°m where (m,p) = 1. Show that 
the Galois groups of = 

x™—u and x™—uP 
are the same. 

8. Let n be a positive integer relatively prime to expchar(F) and let 
w be a primitive n-th root of suny over F. Let S = F(w, ul ”) be 
the splitting field for f(x) =x"—u over F, where u€ F, u £0. If 
4|n and if u te ¢ F then G,(S) is not abelian. 

9. Show that Theorem 13.2.4 and Theorem 13.2.7 fail to hold when 
n is even. Hint: /2€ Q(w), where w is a primitive 8-th root of 
unity. 

10. Prove the following: Let f(x) be a monic irreducible polynomial of 
degree m over F, with constant term -ap. Let n > 2 be an integer 
with the following properties (i) (m,n) = 1, (ii) 4J/n (iii) ag ¢ F” 
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11. 


12. 


13. 
14. 


15. 


for all primes r|n. Then the polynomial f(x") is also irreducible 

over F. 

Let w be a primitive n-th root of unity over F, n odd, and let a be 

a root of x"—u over F. Then S = F(w,q) is the splitting field for 

x"—u. Assume that Gp(S) ~ Ab,. (See Theorem 13.2.1.) In this 

exercise, we determine the largest abelian subextension F” of S. 

a) If G is a group, the subgroup G’ generated by all 
commutators aBa-1g-1, for a, BEG, is called the 
commutator subgroup. Show that G’ is the smallest subgroup 
of G for which G/G’ is abelian. 

b) If the commutator subgroup G,(E)’ of a Galois group Gp(E) 
is closed, that is, if Gp(E)’ = G,(E) for some F < K < E, then 
K is the largest abelian extension of F contained in E. 

c) The commutator subgroup of Ab, is 


‘ 1 0 
Ab, = k 1 :keZ, 
and 
, 1 0 . 
(Gp (S)) = 0(Gp(S)) N A,’ = fa ad :1e€Zy 
d 


where d = [F(w,a):F(w)]. 

d) G,(S)' = Gp w)(S); and so F(w) is the largest abelian 
extension of f contained in F(w,q). 

Prove that if p,,...,p,, are distinct primes then 


(1) [QC /Par-++1y/Pm):Q] = 2 


by induction on m. 

Show that \/60 ¢ Q(+/42,,/10). 

Let n be a positive integer relatively prime to expchar(F) and let 
w be a primitive n-th root of unity over F. Let S = F(w,u!/") be 
the splitting field for f(x) =x "—u over F, where uEF, u# 0. If 
for some prime p|n, we have w, ¢ F and u!/P ¢ F, where w, isa 
primitive p-th root of unity over F, then the Galois group G,(S) 
is not abelian. 

Let x"—a and x"—b be irreducible over F and suppose that F 
contains a primitive n-th root of unity. Then x"—a and x"—b 
have the same splitting field over F if and only if b=c"a™ for 
some c € F andr relatively prime to n. Hint: if the splitting fields 
are the same, consider how the common Galois group acts on a 
root of each binomial. 
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16. 


17. 
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Let F <E be a finite Galois extension and let a, GEE have 

degrees m and n over F, respectively. Suppose that [F(a,@):F] = 

mn. 

1) Show that if a; is a conjugate of a and (7, is a conjugate of ( 

then there is a o € Gp(E) such that oa =a; and of = f;. 

Hence, the conjugates of a + @ are a; + fi. 

2) Show that if the difference of two conjugates of a is never 
equal to the difference of two conjugates of @ then F(a,8) = 
F(a + £). 

Let r be a prime different from char(F). Let f(x) =x"—u and 

g(x) =x"—v be irreducible over F, with roots @ and #, 

respectively. Use the previous problem to show that if 

[F(a,):F] =r? then F(a,8) = F(a +t f). 


Chapter 14 
Families of Binomials 


In this chapter, we look briefly at families of binomials and their 
splitting fields and Galois groups. We have seen that when the base 
field F contains a primitive n-th root of unity, cyclic extensions of 
degree d|n correspond to splitting fields of a single binomial x”—u. 
More generally, we will see that abelian extensions of exponent n 
correspond to splitting fields of families of binomials. We will also 
address the issue of when two families of binomials have the same 
splitting field. 


14.1 The Splitting Field 


Let F be a field containing a primitive n-th root of unity and 
consider a family ¥ of binomials given by 


F = {x"-ul|ueU} 


where U CF is the set of constant terms. We will refer to n as the 
exponent of the family F. 

If S,, is the splitting field for x*"—u, then S= V {S,|u € U} is the 
splitting field for the family ¥. Since each extension F < S,, is Galois, so 
is F<S and Theorem 5.5.3 implies that G,(S) is isomorphic to a 
subgroup of the product 


H= II G,(S,,) 


uéUu 
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Since each F <S,, is cyclic of degree dividing n, the group H is the 
direct product of cyclic groups of order dividing n and is therefore 
abelian with exponent n. (Recall that a group G has ezponent n if a" = 
1 for all a€G.) Hence, G,(S) is abelian with exponent n. An abelian 
extension F <S whose Galois group Gp(S) has exponent n will be 
referred to as an abelian extension with exponent n. 

Thus, if F contains a primitive n-th root of unity, the splitting field 
of any family of binomials over F of exponent n is an abelian extension 
of F with exponent n. Happily, the converse is also true. 

Suppose that F < E is an abelian extension with exponent n. Let K 
be any field for which F < K < E where F < K is finite. Since F < E is 
abelian, so is F < K. In addition, Gp(K) is finite and has exponent n. 
Since a finite abelian group is a direct product of cyclic subgroups, we 
have 


Gp(K) ~ G, x-+-xG,, 


where each G; is cyclic with exponent n and hence order n; | n. Corollary 
5.5.4 implies that K is a composite K = K,---K,, where Gp(K;) ~ G; is 
cyclic of order n,;|n. Since F contains the n,th roots of unity and 
F < K; is cyclic, Theorem 11.1.1 implies that K; = F(q;) is the splitting 
field for 


min(a;,F) = x'i—ayi 


where a; € E. Hence K = F(ay,...,@,,) is the splitting field over F for 
the family 


Fy, = {x"i-a, |i=1,...,m} 


It follows that E is the splitting field for the union J) Fy, taken over all 
finite intermediate fields K. 


Theorem 14.1.1 Let F be a field containing a primitive n-th root of 
unity. An extension F < E is abelian with exponent n if and only if E is 
the splitting field for a family of binomials over F of exponent n. 0 


Definition Let F be a field containing a primitive n-th root of unity. An 
extension F <E is a Kummer extension of exponent n if F<E is 
abelian and has exponent n. 0 


Thus, according to Theorem 14.1.1, the Kummer extensions of F of 
exponent n are precisely the splitting fields over F of families of 
binomials of exponent n. 
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14.2 Kummer Theory 


While each family of binomials gives rise to a unique Kummer 
extension, different families may produce the same extension, that is, 
different families may have the same splitting field. We seek a 
collection of families of binomials such that there is a one-to-one 
correspondence between families in the collection and Kummer 
extensions. 

Let us phrase the problem a little differently, for which we require 
some notation. Recall that if u € F, then by u!/™ we mean a particular 
(fixed) root of x"—u. If ACF, we let A‘/ denote the set of all n-th 
roots of all elements of A. Also, if A C F and n is a nonnegative integer 
then A” = {a™|a€ A}. 

Let F be a field containing a primitive n-th root of unity. Of course, 
we may identify a family ¥ = {x"—b|b € U} of binomials of (fixed) 
exponent n with the set U C F* of constant terms (since binomials with 
zero constant term are not very interesting, we exclude such binomials). 
Moreover, the splitting field for ¥ is S = F(U ue *). 

In seeking a bijective povepoudence between subsets UC F* and 
splitting fields S = F(UY/ ™), it is natural to restrict attention to 
maximal sets U g, F* that generate the given splitting field. As we now 
show, if S = F(U!/") for some U C F*, then 


S= =F (uFey 


where (U,F*") is the subgroup of F* generated by U and F*". To see 
this, note that if u,,...,u, €U and fe F* then for some integer j, we 
have 


(fuft---ugk)!/? = wi f(ul/n)°1...(ud/)°k € F(UI/) 
and so we get nothing new in F(U’ ”) by adjoining any element of 


(U,F*") = {fufl..-ugk | fe F*, u, € U} 
That is to say, 
F((U,F*")!/") _ F(U!/") 


It follows that, as far as splitting fields for families of binomials of 
exponent n are concerned, we may restrict attention to sets of constant 
terms that are subgroups of F* containing F*". Indeed, we will show 
that if U,, is the class of all subgroups U of F* containing F*” then the 
association UHF(U!/*) is a bijection onto the class %, of all Kummer 
extensions E of F with exponent n. We will also obtain a description of 
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the Galois group G of F < E in terms of U. 

Let F < F(U ie ™) be a Kummer extension with Galois group G, and 
let o€G and u€ U. If a is a root of x"—u then ca is also a root of 
x" —u and so 


for some n-th root of unity w, 4. If @ is another root of x"—u, then 
B=wua where wEM, and so G(B/a) = = o(w!) =u! = B/a. It follows 
that 

op 


==" = Ww 


We B= Bp a o,a 


Hence, w, = Wea depends only on o. 
It follows that the map (,):G x U-U,, defined by 


(o,u) =w, = 22, for any a with a® =u 


is well-defined (does not depend on a) and we may write 


oui/n 
yin 


(14.2.1) (o,u) = 
without ambiguity. Moreover, if a" = u and b” = v then for o, 7 EG, 


(or) = 8 = Ee = ww, = ES = (o,u)(r,u) 


and 


o(48) _ oo = (0,u)(o,v) 


(o,uv) = “ap a 


Thus, for each o €G, the map ¥,:U-U,, defined by Yu = (0,u) is a 
group homomorphism and for each u € U, the map 0,:G->U,, defined 
by 9,0 =(¢,u) is also a group homomorphism. This prompts a 
discussion of the following notions. 


Dual Groups and Pairings 


If A and B are groups, we denote by Hom(A,B) the set of all group 
homomorphisms from A to B. Note that Hom(A,B) is a group under 
the product 


(¥8)(a) = (va)(8a) 


with identity being the constant map p~a = 1 for all a€ A. Using this 
notation, we can state with regard to the pairing (14.2.1), that ~, € 
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Hom(U,U,), for all o € G and 0, € Hom(G,U,), for all u € U. 


Lemma 14.2.1 
1) If A, B and C are abelian groups then 


Hom(A x B,C) ~ Hom(A,C) x Hom(B,C) 
2) If A is a finite abelian group of exponent n, then Hom(A,U,) ~ A. 
Hence, |Hom(A,U,) | = |A|. 


Proof. We leave it as an exercise to show that the map 


$:Hom(A,C) x Hom(B,C)—-Hom(A x B,C) 
defined by 
P(p,8)(a,8) = p(a)6(B) 


is an isomorphism, proving part 1). For part 2), since A can be written 
as the product of finite cyclic groups, part 1) implies that we need only 
show that Hom(A,U,,) ~ A when A = (q) is cyclic. If A has order m |n, 
then y € Hom(A,U,) maps A into U,,, since for any a € A we have 


(Ya)™ = Y(a™) = pl =1 


Suppose that U,, = (w) and let ~ € Hom(A,U,) be defined by (a) = w. 
Then 


(vd) = {,0, v?,...,y™7} 


is a cyclic subgroup of Hom(A,U,) of order m= |U,,|. Since every 
element of Hom(A,U,) is uniquely determined by its value on a, we 
deduce that Hom(A,U,)=(¥) is cyclic of order m, whence 
Hom(A,U,) ~ A. ll 


Definition If A, B and C are abelian groups, a pairing of A x B into C is 
a map (,):A x BC that is a “bihomomorphism”, that is, 


1) For each a€ A, the map ¥,:B—C defined by ¥,(8) = (a,8) is a 
group homomorphism. 
2) For each 6 €B, the map 94:A—C defined by 64(a) = (a,6) is a 


group homomorphism. 0) 


A pairing is the analog of a bilinear map between vector spaces. Note 
that (1,8) =(a,1)=1 for all a€ A and BEB and that (a,6)"!= 
(a-1,8) = (a,B7"). If SC A and T CB, we set 
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(S,T) = {(s,t) |s ES, t € T} 


(We will write ({a},T) as (a,T) and (S,{@}) as (S,f).) The left kernel of 


a pairing is the set 

Ky, = {a € A | (a,B) = {1}} 
and the right kernel is defined similarly 

Kp = {6 € B| (A,8) = {1}} 


It is easy to see that these kernels are subgroups of their respective 
parent groups. 

Note that (a,,2) = (a 9,8) for all @ €B if and only if (a,a71,B) = 
{1}, that is, if and only if a,az! € Ky, or equivalently, a,K;, = a,Ky. 
Similar statements holds for the right kernel. Thus, we may define a 
pairing from A/K,, x B/Kp to C by 


(aK; ,GKp) = (a, 6) 


and this pairing is nonsingular, that is, both the left and right kernels 
are trivial. 


Theorem 14.2.2 Let (,):A x B—U,, be a nonsingular pairing from abelian 
groups A and B into U,. Then A and B both have exponent n. 
Moreover, A is finite if and only if B is finite, in which case 


1) A~Hom(B,U,) and B ~ Hom(A,U,), 
2) |A| = |BI. 


Proof. First observe that if a € A then (a",f) = (a,@)" = 1 for all GE 
B, and so a" € K,, whence a" =1 and A has exponent n. A similar 
statement holds for B. Now consider the map A—Hom(B,U,) defined 
by ar y,, where ~,:6r9(a,). Since 


a8) = (a0',8) = (a,8)(a',8) = by(8)0,(8) 


the map ary, is a group homomorphism from A to Hom(B,U,). If 
%q = 1 is the constant homomorphism then (a,3)=1 for all 6 €B, 
that is, a € Kp, whence a = 1. Hence, the map ary, is injective. 

It follows from Lemma 14.2.1 that if B is finite, then 


|A| < |Hom(B,U,)| = |B] 
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The dual argument shows that |B| < |A| and so |A| = |B]. This 
also implies that the monomorphism ay, is an isomorphism. & 


Back to Binomials 


We resume our study of the pairing (,):G x U-U,, defined by 


oul/n 
(o,u) = “t/a 


Since the identity is the only map in G that fixes every root of every 
binomial in the family, the left kernel of this pairing is 


Ky = {0 €G|oul/* =u for all u € U} = {ce} 


An element u € U is in the right kernel if and only if 


oul/n = y/n 
for all o EG, that is, if and only if u!/"€ F(G)* = F*. Since ul/” ¢ F* 
if and only if u € F*", we have Kp = F*". 
It follows that the pairing (,):G x (U/F*")-U,, given by 
owl/n 


(o,uF*") = an 


is nonsingular. We may thus apply Theorem 14.2.2. 


Theorem 14.2.3 Ee F be a field containing a primitive n-th root of 
unity. If E = F(U'/) then the pairing 


(,):Gp(E) x U/F*"U,, 


given by 
oul/2 
“7a 


(o,uF*") = 


is nonsingular and so U/F*" has exponent n and | G;(E)| = [E:F] is 
finite if and only if (U:F*") is finite, in which case 


[E:F] = (U:F*") 
and . 
Gp(E) ~ Hom(U/F*",U,) a] 
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Theorem 14.3.1 not only describes the Galois group, of a Kummer 
extension, but allows us to show that the map Ur-F(U / ”), from U,, to 
%,,, is a bijection. 


Theorem 14.2.4 Let F be a field containing a primitive n-th root of 
unity. Let %, be the class of all Kummer extensions F < E of F with 
exponent n and let U,, be the class of all subgroups U of F* containing 
F*", Then the map UHF(UY ”) is a bijection from U,, onto %, with 
inverse given by Er+E*"N F* 


Proof. To show that the map in question is injective, suppose that 
F(U!/") = F(V/), with U, VE ,. If u€ U, then u’/" € F(V'/*) and 
so there exists a finite subset Vg of V for which ue F(vi/n), Let 
V, =(Vo,F*") be the subgroup generated by Vg and F*". Then 


vi/n Cc vile Cc vila 
and 


w/ € F(Vg!") C F(V}/") 


Note that V, EU, is finitely generated (by Vg) over F*" and hence 
(V,:F*") is finite. Theorem 14.2.3 implies that 


[F(V}/"):F] = (VF) 


Let us now adjoin u. Let V. = (u,V,) be the subgroup generated by u 
and V,. Then V, € ‘U,, and 


F(V}/") = F((u,V,)/") = F((a,V}/")) = F(V}/*) 
Another application of Theorem 14.2.3 gives 
(Vo:F"") = (V,:F*") 


and since V, C V2 we get V, = Vp. It follows that u € V, C V and since 
u was arbitrary, UC V. A symmetric argument gives V C U, whence 
U =V. This proves that the map UHnF(UY ”) is injective. We have seen 
that any Kummer extension F < E in %, is a splitting field extension 
for a family ¥ of binomials with exponent n. If C is the set of constant 
terms and if U is the subgroup of F* generated by C and F*" then E = 
F (ul ™) and so the map is surjective. 

Let F <E be a Kummer extension with exponent n and let U = 
E*"MF*. Then U is a subgroup of F* containing F*", that is, U € U,. 
It is clear that EC F(UY ). For the reverse inclusion, let 6" € U. Then 
B" = a™ for some a € k*, which implies that @ is a root of x"—a"€ 
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F[x] and so 8=wa€E*. This shows that Ul/" C E* and so E= 
F(U!/®), Hence, E5U = E*" 1 F* is the inverse map. § 


Exercises 


1. 


Referring to Lemma 14.2.1, show that the map 


%:Hom(A,C) x Hom(B,C)—Hom(A x B,C) 
defined by 
P(p,0)(a,8) = $(a)0(B) 


is an isomorphism. 

Let A be a finite group and let ~€Hom(G,U,). Show that 

ac av(a) = [Al if ofa) =1 for allac A and D0, (a) =0 

otherwise. 

Let A be a finite abelian group with exponent n. If a € A satisfies 

(a) = 1 for all » € Hom(A,U,) then a = 1. 

Let B be a proper subgroup of a finite abelian group A and let 

a€A-—B. Then there exists ~ € Hom(A,U,) such that ¥(B) = 

{1} but ¥(a) #1. 

Let A be a finite abelian group and let B be a subgroup of A. Let 

B={peE Hom(A, U,) | ¥(B) = {1}}. Show that Hom(B,U,) ~ 

Hom(A,U,,)/B*. 

Let B be a subgroup of a finite abelian group A. Let En ={PeE 

Hom(A,U,,) | ¥(B) = {1}}. Show that Hom(A/B,U,) ~ 

Let $= {f,(x)} be a family of binomials with ia £,(x) =n. 

Suppose that n;|n for all i and let F contain a primitive n-th root 

of unity. Show that there is a family of binomials, each of which 

has degree n, with the same splitting field as F. 

In this exercise, we develop the analogous theory for families of 

polynomials of the form ¥ = {xP — x —u;} where p = char(F) # 0. 

1) Prove that F < E is abelian with exponent p if and only if E is 
the splitting field of a family of the form F. 

2) Let P:F4F be the map Pa=aP-—a. Let P1U={aeF 
such that Pae€U}. Let U be the class of all additive 
subgroups of F with ®~!F CU. Let &, be the class of all 
abelian extensions F <E of F with exponent p. Prove the 
following theorem: The map UrsF(S7 'U) is a bijection 
between U and &. If F< E= F(971U) is in §, has Galois 
group G then there is a well-defined pairing e ‘\: Gx U-U, 
given by (o,a) = of — 8 for any 6 € $-ly. The left kernel is 
{1} and the right kernel is PF. The extension F < E is finite if 
and only if (U:9F) is finite, in which case [E:F] = (U:F) and 
G ~(U/PF)~. 


Appendix 
Mobius Inversion 


MGbius inversion is a method for inverting certain types of sums. 
The classical form of Mébius inversion was originally developed 
independently by P. Hall and L. Weisner in 1935. However, in 1964, 
Gian-Carlo Rota generalized the classical form to apply to a much 
wider range of situations. To describe the concept in its fullest 
generality, we require some facts about partially ordered sets. 


PARTIALLY ORDERED SETS 


Definition A partial order on a nonempty set P is a binary relation, 
denoted by < and read “less than or equal to,” with the following 
properties. 


1) (reflexivity) For all ae P, 
a<a 


2)  (antisymmetry) For all a,b € P, 
a<bandb<a implies a=b 
3) (transitivity) For all a,b,c € P, 
a<b and b<c implies a<c O 


Definition A partially ordered set is a nonempty set P, together with a 
partial order < defined on P. The expression a <b is read “a is less 
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than or equal to b.” If a,b € P, we denote the fact that a is not less 
than or equal to b by a¢b. Also, we denote the fact that a <b, but 
afb, bya<b. 

If there exists an element z € P for which z < x for all x € P, we call 
z a zero element and denote it by 0. Similarly, if there exists an element 
y €P for which x < y for all x € P, then we call y a one and denote it 
by 1.0 


As is customary, when the partial order < is understood, we will use 
the phrase “let P be a partially ordered set.” 

Note that, in a partially ordered set, it is possible that not all 
elements are comparable. In other words, it is possible to have x,y € P 
with the property that x { y and y ¢ x. Thus, in general, x ¢y is not 
equivalent to y<x. A partially ordered set in which every pair of 
elements is comparable is called a totally ordered set or a linearly 
ordered set. 


Example A.2.1 

1) The set R of real numbers, with the usual binary relation <, is a 
partially ordered set. It is also a totally ordered set. 

2) The set N of natural numbers, together with the binary relation of 
divides, is a partially ordered set. It is customary to write n|m 
(rather than n < m) to indicate that n divides m. 

3) Let S be any set, and let 9(S) be the power set of S, that is, the 
set of all subsets of S. Then #(S), together with the subset relation 
C, is a partially ordered set. 0 


Definition Let P be a partially ordered set. For a,b € P, the (closed) 
interval [a,b] is the set 
[a,b] = {x € Pla<x<b} 


We say that the partially ordered set P is locally finite if every closed 
interval is a finite set. 0 


Notice that, if P is locally finite and contains a zero element 0, then 
the set {x € P|x <a} is finite for all a€ P, for it is the same as the 
interval (0,a]. 


THE INCIDENCE ALGEBRA OF A PARTIALLY ORDERED SET 
Now let P be a locally finite partially ordered set, and let F be a 
field. We set 


A(P) = {f:P x PHF | f(x,y) = 0 if x ¢ y} 
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Addition and scalar multiplication are defined on A(P) by 
(f+g)(y) = f(x,y) + (xy) 
(kf)(x,y) = k[f(x,y)) 


We also define multiplication by 


(feg)(xy)= So £(x,2)g(z,y) 


xozdy 


and 


the sum being finite, since P is assumed to be locally finite. Using these 
definitions, it is not hard to show that A(P) is an algebra, called the 
incidence algebra of P. The identity in this algebra is 


= 1 if x=y 
soo) =f if x#y 


The next theorem characterizes those elements of A(P) that have 
multiplicative inverses. 


Theorem A.2.1 An element f€A(P) is invertible if and only if 
f(x,x) # 0 for all x € P. 
Proof. An inverse g of f must satisfy 
(A.2.1) 5 f(x,z)g(z,y) = (x,y) 
xSzsy 


In particular, for x = y, we get 
f(x,x)g(x,x) = 1 
This shows the necessity and also that g(x,x) must satisfy 


(A.2.2) g(x,x) = Gx) 


Equation (A.2.2) defines g(x,y) when the interval [x,y] has cardinality 
1, that is, when x = y. We can use (A.2.1) to define g(x,y) for intervals 
[x,y] of all cardinalities. 

Suppose that g(x,y) has been defined for all intervals with cardinality 
at most n, and let [x,y] have cardinality n+1. Then, by (A.2.1), since 
x # y, we get 


f(x,x)g(xy)=—- D> £(x,2)g(z,y) 


x<zSy 
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But g(z,y) is defined for z > x since [z,y] has cardinality at most n, and 
so we can use this to define g(x,y). Il 
Definition The function ¢ € A(P), defined by 


_jl if x<y 
¢(x,y) = 0 ifxty 


is called the zeta function. Its inverse p(x,y) is called the Mébius 
function. 0] 

The next result follows from the appropriate definitions. 
Theorem A.2.2 The MGbius function is uniquely determined by any of 
the following conditions. 
1) p(x,x) = 1 and, forx<y, 


XS wey) =0 


xQzy 


2) = (x,x) = 1 and, forx<y, 


» p(x,z) = 0 


x<zsy 
3) pe(x,x) = 1 and, for x <y, 


u(x,y) Fs ~ (zy) 


x<zsy 
4)  p(x,x) = 1 and, forx<y, 


a(xy)= — >> w(x,2) 0 


x<z<y 
Now we come to the main result. 


Theorem A.2.3 (MGbius Inversion) Let P be a locally finite partially 
ordered set with zero element 0. If f and g are functions from P to the 
field F, then 


(A.2.4) a(x)= >> fly) => f(x) = D> gly)u(y,x) 
y<x y<x 
If P is a locally finite partially ordered set with 1, then 


(A.2.5) g(x)= >> fly) => f(x) = D> u(xy)aly) 


xSy xSy 


Proof. Since all sums are finite, we have, for any x, 
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>> aly)u(y,x) = >| fe) po) 
ysx y<xjz<y 
a f(z) u(y,x) 
< x 
< 


a<x z<y< 


=) YO vx) 


Zz<ox z<y<x 
= d f(z)6(z,x) = f(x) 


The rest of the theorem is proved similarly. § 


The formulas (A.2.4) and -(A.2.5) are called M6bius inversion 
formulas. 


Example A.2.2 (Subsets) Let P = 9(S) be the set of all subsets of a 
finite set S, partially ordered by set inclusion. We will use the notation 
C for subset and C for proper subset. (In the text, we use C for 
subset.) The zeta function is 


_j1 if ACB 
¢(A,B) -| 0 otherwise 


The Mobius function p is computed as follows. From Theorem A.2.2, 


we have 
H(A,A) = 1 


and 


w(A,B)=—- >>) p(A,X) 
ACXCB 


So, for x,y,z ¢ A, we have 


H(A,A U {x}) = — p(A,A) = -1 
H(A,A U {x,y}) = — u(A,A) — u(A,A U {x}) — W(A,A U {y}) 
~1+14+1=1 
M(A,A U {x,y,z}) = — w(A,A) — w(A,A U {x}) — n(A,AU {y}) 
— M(A,A U {x,y}) — u(A,A U {x,2}) — (A,A U {y,2}) 
={fi pi pi=feie7 =-1 


It begins to appear that the values of yp alternate between +1 and —1 
and that 
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(-1)!B-Al if ACB 
otherwise 


H(A,B) = a 


To verify this, we have p(A,A) = 1 and, for A CB, 


|B-A| 
>: (-1)/8-Al = S age ANN 1) = 0 


ACXCB k=0 


Now let P,,...,P,, be properties that the elements of a set S may or 
may not possess. For K C {1,...,k}, let E(K) be the number of 
elements of S that have properties P; for i€ K, and no others. Let F(K) 
be the number of elements of S that have at least properties P;, for 
ié K. Then 


F(K)= > E(L) 
KCL 


Hence, by Mobius inversion, 
E(K)= 5) (-1)t-Kl pz) 
KCL 
In particular, if K = @ is the empty set, then 
BO) = 0 (-1)F(L) 
LCS 


But E(@) is the number of elements of S that have none of the 
properties, and so we get 


Number elements with no properties = Dy (-1)* Ss: F({i,,..., ij }) 
k>0 ign cal 


This formula is the well known Principle of Inclusion-Exclusion, which 
we now see is just a special case of Mobius inversion. 


CLASSICAL MOBIUS INVERSION 

Consider the partially ordered set N of natural numbers, ordered by 
division. That is, x is less than or equal to y if and only if x divides y, 
which we will denote by x|y. Notice that the natural number 1 (and 
not 0) is the zero element in this partially ordered set, since 1|n for any 
natural number n. 

In this case, the Mébius function (x,y) depends only on the ratio 
y/x, and is given by 
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1 if {=1 
B(x%Y) = H(z) = (-1)* if }= = P,P2'''P, for distinct primes p; 
0 otherwise 


Notice that the omer wire case can occur if either xy (x does not 
divide y) or if p 21 (y/x) for some prime p. Thus, the value of p(x,y) 
depends on the nature of the prime decomposition of the ratio y/x. 

To verify that this is indeed the Mobius function, we first observe 
that u(x,x) = p(1) = 1. Now let x|y, x #y and 


: = Pips . ‘pia 
where the p; are distinct primes. Then 


~+OM= DL eD= DV eH= YO () (-1)) = 0 


xlaly Elk 1[k ly 1Sisn 


Is 


Now, in the present context, the Mdbius inversion formula becomes 


g(n) = » fk) => f(n)= » a(k)u(F) 
n k|n 


This is the important classical formula, which often goes by the name 
MGbius inversion formula. 0 


MULTIPLICATIVE VERSION OF MOBIUS INVERSION 
We now present a multiplicative version of the Mébius inversion 
formula. 


Theorem A.2.4 Let P be a locally finite partially ordered set with zero 
element 0. If f and g are functions from P to F, then 


ax)=]] tf) = ft =T] leyi"o” 
ysx y<x 


Proof. Since all products are finite, we have, for any x, 


Dl eo’ = TT (0 fat 


y<x y<xjz<y 


=] IT ta” 


ugx u<y<x 
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Y ay.x) 
at II f(z) zSy<x 


Z<x 


= JJ 2°) = tx rN 


Zz<x 


Example A.2.3 Let P =N, and let F be the field of rational functions in 
x. Consider the formula 


m—1= TT AQ) 


k|n 


Then, if we let f(k) = Q,(x) and g(n) = x"—1, Theorem A.2.4 gives 


Q(x) = II (xk ~ pies) = Il (xn/k _ 100) q 


k|n k|n 
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